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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Gatanaga, T. 

Granger, G.A. 

(ii) TITLE OF INVENTION: Factors Altering Tumor Necrosis 

Factor Receptor Releasing Enzyme Activity 

(iii) NUMBER OF SEQUENCES: 154 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORRISON & FOERSTER 

(B) STREET: 755 PAGE MILL ROAD 

(C) CITY: Palo Alto 

(D) STATE: CA 
<E> COUNTRY: USA 
<F> ZIP: 94304-1018 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 
<A> APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: USSN 09/081,385 

(B) FILING DATE: 014-NOV-1998 

<viii> ATTORNEY /AGENT INFORMATION: 

(A) NAME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: 22000-20577.21 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 650-813-5600 
<B) TELEFAX: 650-494-0792 
(C) TELEX: 706141 



(2) INFORMATION FOR SEQ ID NO:1: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4047 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

AAGCTTTTTG CTTTCCTTCC CCGGGAAAGG CCGGGGCCAG AGACCCGCAC TCGGACCAGG 60 

CGGGGGCTGC GGGGCCAGAG TGGGCTGGGG AGGGCTGGGA GGGCGTCTGG GGCCGGCTCC 120 

TCCAGGCTGG GGGCCGCCAG CTCCGGGAAG GCAGTCCTGG CCTGCGGATG GGGCC6CGCG 180 

TGGGGCCCGG CGGGGCGGCC TCGGGAGGCG TCCAGGCTGC GGGAGCGGGA GGAGCGGCCG 240 

TGCGGGCGCC AGCGCCGTGG GTGGAGGTCG CCGTCCCTCC TGAGGGGCAG CCAGTGCGTT 300 

TGGGACCCGG GAGCAGAGCC CGCGCCTCCC CAGCGGCCTC CCCGGGGGTC TCACCGGGTC 360 

ACCCGAGAGC GGAGGCCCCG GCTCCGCAGA AACCCGGGGC GGCCGCGGGG AAGCAGCGCC 420 

CTCAGGCGTC GGAGGAGCCC CCAGAAGGAC CTCGCGCCTT CCCGCCGGGC TCC6ACCGCC 480 

TGGGTTCGGT GCGGGACGGC CCAGGCCGCC AGGACCCCCA AGCGCAGCTC AGTCTGC6GG 540 

GCACGACCCA GAGGCCAGCA GCAGAGGACG GGGCCGGGGC CGGGAGAGGG CGG6GAGGGC 600 

GCTCCTGGGA GGTCAAGGCC AGGGCTAGAC TTTCAGGGTC ATGGCCTGGC CCCTCATCCC 660 

CAGGGAGGTG AGGGGGCTCT GTGAGCAGAG GGGGCCCCGG TGGAGAAGGC GCTGCTAGCC 720 
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AGGGGCGGGG CAGGAGCCCA GGTGGGGACT TAAGGGTGGC TGAAGGGACC CTCAGGCTGC 780 

AGGGATAGGG AGGGAAGCTA GGGGTGTGGC TTGGGGAGGT GCTGGGGGAC CGCGGGCGCC 840 

CTTTATTCTG AAGCCGAATG TGCTGCCGGA GTCCCCAGTG ACCTAGAAAT CCATTTCAAG 900 

ATTTTCAGGA GTTTCAGGTG GAGACAAAGG CCAGGCCCAG GTGAAAATGT GGCAGTGACA 960 

GAGTATGGGG TGAGAACCAC GGAGAGAGGA AGTCCCCGAG GCGGATGATG GGACAGAGAG 1020 

CGGGGACCAG AATTTTTTAA AACGCATCTG AGATGCGTTT GGCAGACTCA TAGTTGTTTT 1080 

CCTTTCACGG AGAAAGTGTG GGCAGAAGCC AGCTCTAAAG CCCAGGCTGC CCAGCCTGCA 1140 

CTGGCAGAGC TGACGGAAGG CCAGGGCAGA GCCTTCCCTC CCTGTCACAG ACATGAGCCC 1200 

TGGAGATCTG GAATGAGGCA GATGTGCCCA GGGAAAGCTG ATCCGCCCCG ACCCAGGGCC 1260 

CCCCGGGTGC CCCTTTGAGC GTGGAATCGT TGCCAGGTCA TGGCTCCCTG CTATCGAACA 1320 

CCGGACACGG GTCGTGTGCT GCACCTGGCA GTTGCAGGAC CGACACCCAC AATGCCTTAA 1380 

GAGGTGATGA CTGCCTTCCA GGGGCCTGGC TGGCTGACAC TTTGCATGGC TCCTGGAGAA 1440 

GAGGGATTGA GTGGAGTCCA CGGGTCATGG CCACGTCCTG GGTGCTGCCT CTGAGGCAGG 1500 

GCCCGGCTGG GGTGAGAAGG GGCTGGAGAC AGGTTCCTGC CAGTTCAGCC TCTAACCGGT 1560 

GGTCTTCATG CCTAGGAACC CACTGGGGGC TTATGAAACT GCAGGTGGCT GAGTCCTTGC 1620 

CATGGGGTCT CTCCTTCAGG AGGTCTGGGT GGGGCCGGAG ACTGTACCCC ACAAAGGGTC 1680 

CCAGGTGAGG CGGATGTGGC CTGGCGCTGT GTGGCTCTGG ACCTAGTCCT TGGGCTTGGG 1740 

CTGGCGCCCA GGGCCTGGGC TTGAGACAGC TGTGACGCAG GCAAGCCATT TACCCCGTTT 1800 

GTGGGGACAT TACATCTTCC TAGCTTGGAA CACACAGGCA GCCAGGGTTG TTATCCACAT 1860 

TCCTCCTCCA TGTTCTTCTC TTGAGAACTT TTACCAGGTA TGTCAGGAGC TGGGCTCCAC 1920 

CAGGGAGACT CAAGTGGAAA GCCCTCATCC TTGTCCTCCA GGAGACAGGA AAACCTATGG 1980 

TTACAATTCC AGGGACAAGA GCGATGCATG TGAGGTGTGG CAAATCTCAC TGTTCAACTG 2040 

GAGAAATCAG AGACAGCTTC CTGGAGGCAG TGACACCTGG ACAGGCTTCT CCACAGGAGG 2100 

AAGCGAGTGA GAGAAGCCAA CTGGGATGGA CCCATCATGT AGGGGGAACA GTGCGCGCAG 2160 

AACCAACAAC CACCCCCACC CTAGGCCCAG AGCTCACGGA GAGAGCTGGG CCTCTCGGGG 2220 

TGACTACATA GTTCCCTGCT GGATCTTAGG TCTTGTCCTT GGGCAGCTCT GCTGAGACCT 2280 

CTATGCCTGT TCCAGGCTGC ACCAAGGTTT TGTGACTATT GGTCTGGGGT TGTTTTGCAG 2340 

CAACTGAAGT GTTCTGTTGT AAAACAGGCA CTTGATTTGC TGGAAGGAAT GCTGTTTGTT 2400 

CTTGCTGCGA CAAACATTGA GCAGCATTTA GTGGGCGGTT TATATCTTGT GGAGTAATGG 2460 

GTGTTTTTGA AGTCTGTCCT GGGTACTGCA CATTAAAAGG AATATCATTT TCTGAAACAT 2520 

TGCTATTTTC CACACCAGAA ATCATATCCT CTTGCTGGTC CATGTCTGAA GACCTTACAC 2580 

GAGAAAGTCT TAATGTAAGT TTAGTAGAGT CCTTGGATGG AGAACTAATT ATATCATACA 2640 

TTGCCGCTTT CTCACTCTGC TCTTTTTCAT CCTTGCCTAA TTTCATTTTC TTCTGCTTCT 2700 

TTTGTTTTCT TTCT6GAGAA TCTAGCAAGA TATCTGGTGG AACATCTCGA GGTGATGAAC 2760 

AAGGTAGAGA CTGAGATTGT AGGATTAAAG GTGGTCTTGA GCCTTTAGGA GTTCCTTCAC 2820 

TTCCAGCAGG GGAGCATACT GGCTGTGGAG ATCTCAAGGG AAAAGATGCA GCATTCCTCA 2880 

TTGTTGAAGA ATCTCCATCG TCACTACTTA GCCTGTGCAC CATGTGTAGG TAGTCCTCAC 2940 

TTGAACCATG TCTAGGATTA TCAGCATGAT GATTAGCTGA ATTGCCAGAC AACGGACCAG 3000 

AAACTTTATT ATCATGTATG TTTCTCAAAC CACCTGCAAC AATGGGACTT GATACCGATG 3060 

CTTGTTGCAT CTGTGGATGT GTTGTGTAAC TTGAAGGATG GGAATATGGC ATGTATCCTG 3120 

CAGGGCTTTG TGGGGCGTAT GGACTAGGCA CTGGGCTATT TTGCTGTGGC ATAAATCTGT 3180 

TCCCAGAGCT TGTCTGTGGT GGCACAAACC GGCTGGAGGG GCTATGTGAG ATAGTGGTTT 3240 

GTTGATAATT GGAAGATGCA GGACTACTGT GCATGGAATT CTGAGAAAGT T TAT ACT GAG 3300 

ACATCATCAT TCCACTTTGT ACATATCTGT TCTGCATGCT TTTCTCCCTG AAAACATTAG 3360 

GACTCCTTGC CAG6ACGGCC TGCAACAAGA CTGGTATGTC ACCTTCTGGG TCATCACTGC 3420 

CAAGGTTATC TTTCAACTCT ATGTGATCTG TTGATACCTG GTTGAGGCTA TGGACAAGCT 3480 

GTGAAACCAA ATTGTCATCC CTACAAGCCA AAAGGCAGTT CACCTCTTCT GCTATTCGTG 3540 

CATTAAAGAG AAGGCTCTTT GTAGTTGTAG CAGGTAAAGG AGATGGAAGA GGCAGCTGGT 3600 

TCAGGAGGTC TGTGAGACTA GCAATCCCCG CAAGAGTAGT AATGGGGACA TGGGGCATAT 3660 

CCCCATTCAT CCTGAATTTC TGGAATGGTG TTGCCTATAA AAGTACTTAG TTCAGGTGCC 3720 

AGCTGTCATT ACTTCCCATT TCCCAAACAC TGGGCGAATC GGCGTCTGAA TCCAAGGGGA 3780 

GGCCGAGGCC GCTGTGGCGA GAGACTATAA TCCGGGCCGG GAGGGGGGGC GGCTAC6GCT 3840 

CCTCTTCCGT CTCCTCAGTG CGGGGAACAT GTAGAGCCGG GGGGAGACCA GCCGAGAAGA 3900 

CAAATCGTTG CTTCTTCTTC CTCCTCCTCC TCCTTCTCCC ACATAGAAAC ACTCACAAAC 3960 

ACCCGACCAC GGGCCCGAGC TACCGGGGGG GCATCGCCGC GGGCCCGGGA ACCAATTCTC 4020 

CTGTCGGCGG GGGCGTCCTT TGGATCC 4047 

(2) INFORMATION FOR SEQ 10 NO:2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 739 base pairs 
<B> TYPE: nucleic acid 

<C) STRAND EDNESS: double 
(0> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic ONA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GGATCCAAAG GTCAAACTCC CCACCTGGCA CTGTCCCCGG AGCGGGTCGC GCCCGGCCGG 60 
CGCGCG6CCG GGCGCTTGGC GCCAGAAGCG AGAGCCCCTC GGGGCTCGCC CCCCCGCCTC 120 
ACCGGGTCAG TGAAAAAACG ATCAGAGTAG TGGTATTTCA CCGGCGGCCC GCAGGGCCGG 180 
CGGACCCCGC CCCGGGCCCC TCGCGGGGAC ACCGGGGGGG CGCCGGGGGC CTCCCACTTA 240 
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TTCTACACCT CTCATGTCTC TTCACCGTGC CAGACTAGAG TCAAGCTCAA CAGGGTCTTC 300 

TTTCCCCGCT GATTCCGCCA AGCCCGTTCC CTTGGCTGTG GTTTCGCTGG ATAGTAGGTA 360 

GGGACA^GG gIaTCTCGTT CATCCATTCA TGCGCGTCAC TAATTAGATC ACGAGGCATT 420 

TGGCTACCTT AAGAGAGTCA TAGTTACTCC CGCCGTTTAC CCGCGCTTCA TTGAATTTCT 480 

TCACTTTGAC ATTCAGAGCA CTGGGCAGAA ATCACATCGC GTCAACACCC GCCGCGGGCC 540 

TTCGCGATGC TTTGTTTTAA TTAAACAGTC GGATTCCCCT GGTCCGCACC AGTTCimAo. 600 

CGGCTGCTAG GCGCCGGCCG AAGCGAGGCG CCGCGCGGAA CCGCGGCCCC CGGGGCGGAC 660 

CCGCGGGGGG CACCGGGCCD CGGCCCCTCC GCCGCCTGCC GCCGCCGCCG CCGCCGCGCG 720 
CCGAAGAAGA AGGGGGAAA 

(2) INFORMATION FOR SEQ ID NO:3: 

<i> SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 233 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDED NESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CAAGAGTGGC GGCCGCAGCA GGCCCCCCGG GTGCCCGGGC CCCCCTCGAG GGGGACAGTG 60 

c£cCGCCGC GGGGGCCCCG CGGCGGGCCG CCGCCGGCCC CTGCCGCCCC GACCCTTCTC 20 

CCCCCGCCGC CGCCCCCACG CGGCGCTCCC CCGGGGAGGG GGGAGGACGG GGAGCGGGGG 180 

AGAGAGAGAG AGAGAGAGGG CGCGGGGTGG CTCGTGCCGA ATTCAAAAAG CTT 233 

(2) INFORMATION FOR SEQ ID NO:4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GGATCCAAAG AATTCGGCAC GAGGTAGTCA CGGCTCTTGT CATTGTTGTA CTTGACGTTG 60 

AGGCTGGTGA GCTTGGAAAA GTCGATGCGC AGCGTGCAGC AGGCGTTGTA GATGTTCTGC 120 

CCGTCCAGCG ACAGCTTGGC GTGCTGGGCG CTCACGGGGT CCGCATACTG CAGCAGGGCC 180 

TGGAACTGGT TGTTCTTGGT GAAGGTGATG ATCTTCAACA CTGTGCCGAA CTTGGAGAAA 240 

ATCTGGTGCA GCACATCCAG GGTCACAGGG TAGAAGAGGT TCTCCACGAT "JCCTGAGC 300 

ACGGGGCTCT GCCCGGCCAT CGCCATCCCT GCATCCACGG CCGCCGCCGA GGCAGCCAAG 360 

GCCAGGTTCC CCGACTGGAC CGAGTTCACC GCCTGCAGGG CCGCCTGGGC CCGCGCCTGG 420 

TTGGGAGAGC TGTCGGTCTT CAGCTCCTTG TGGTTGGAGA ACTGGATGTA GATGGGCTGG 480 

CCGCGCAGCA CAGGGGTCAC CGAGGTGTAG TAGTTCACCA TGGTATTGGC AGCCTCCTCC 540 

GTGTTCATCT CGATGAAGGC CTGGTTTTTC CCCTTCAGCA TCAGGAGGTT GGTGACCTTC 600 

CCAAAGGGCA GCCCCAGGGA GATGACTTCC CCCTCCGTGA CGTCGATGGG GAGCTTCCGG 660 

ATGTGGATCA CTCTAGAGGG GACGCCTGCA CTTCGGCTGT CACCTTTGAA CTTCTTGCTG 720 

TCATTTCCGT TTGCTGCAGA AGCCGAGTTG CTGCTCATGA TAAACGGTCC GTTAGTGACA 780 

CAAGTAGAGA AAAGCTCGTC AGATCCCCGC TTTGTACCAA CGGCTATATC TGGGACAATG 840 

CCGTCCATGG CACACAGAGC AGACCCGCGG GGGACGGAGT GGAGGCGCCG GAATCCTGGA 900 

GCTAGAGCTG CAGATTGAGT TGCTGCGTGA GACGAAGCGC AAGTATGAGA GTGTCCTGCA 960 

GCTGGGCCGG GCACTGACAG CCCACCTCTA CAGCCTGCTG CAGACCCAGC ATGCACTGGG 020 

TGATGCCTTT GCTGACCTCA GCCAGAAGTC CCCAGAGCTT CAGGAGGAAT TTGGCTACAA 1080 

TGCAGAGACA CAGAAACTAC TATGCAAGAA T6GGGAAACG CTGCTAGGAG CCGTGAACTT 1140 

CTTTGTCTCT AGCATCAACA CATTGGTCAC CAAGACCATG GAAGACACGC TCATGACTGT 1200 

GAAACAGTAT GAGGCTGCCA GGCTGGAATA TGATGCCTAC CGAACAGACT TAGAGGAGCT 1260 

GAGTCTAGGC CCCCGGGATG CAGGGACACG TGGTCGACTT GAGAGTGCCC AGGCCACTTT 1320 

CCAGGCCCAT CGGGACAAGT ATGAGAAGCT GCGGGGAGAT GTGGCCATCA AGCTCAAGTT 1380 

CCTGGAAGAA AACAAGATCA AGGTCATGCA CAAGCAGCTG CTGCTCTTCC ACAATGCTGT 1440 

GTCCGCCTAC TTTGCTGGGA ACCAGAAACA GCTGGAGCAG ACCCTGCAGC AGTTCAACAT 1500 

CAAGCTGCGG CCTCCAGGAG CTGAGAAACC CTCCTGGCTA GAGGAGCAGT GAGCTGCTCC 1560 

CAGCCCAACT TGGCTATCAA GAAAGACATT GGGAAGGGCA GCCCCAGGGT GTGGGAGATT 620 

S W?C^TGT CACTTGCCCT CTGGCTTGGG CTOTTTTTC TGGCTGGGGC 680 

CTGACACCAG TTTTGCCCAC ATTGCTATGG TGGGAAGAGG GCCTGGAGGC CCAGAAGTTG 1740 

"kcStgtc TATCTTCCTG GCCACAGGGC TTCATTCCCA GATCTTTTCC TTCCACTTCA 800 

CAGCCAACGG CTATGACAAA ACCACTCCCT GGCCAATGGC ATCACTCTTC AGGCTGGGGT 860 

GTGCTCCCTG ACCAATGACA GAGCCTGAAA ATGCCCTGTC AGCCAATGGC AGCTCTTCTC 920 

GGACTCCCCT GGGCCAATGA TGTTGCGTCT AATACCCTTT GTCTCTCCTC TATGCGTGCC 1980 

CATTGCAGAG AAGGGGACTG GGACCAAAGG GGTGGGGATA ATGGGGAGCC CCATTGCTGG Z040 
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CCTTGCATCT GAATAGGCCT ACCCTCACCA TTTATTCACT AATACATTTT ATTTGTGTTC 2100 

TCTAATTTAA AATTACCTTT TCATCTTGCT TGATTTTCCT TCAGCTAAAT TAGAAATTTG 2160 

TAGTTTTTCC CCTAAAAAAT TCAATGGCAT TCTTTCTTAT AAATTACATT CTCTGATTTT 2220 

CTTGTCAGCC TGCTTCAAGG AAATCCATGT GTTCAAAATG CTTGCTCGCA GTTTGCTCCA 2280 

TACCAAATGG TTGCTTAACC CAAATATCTG AGCAGCAAAT TGAGCTGATC CTTCTGGAGA 2340 

AAGTACGGTT GAACAGCCAA GACCACTGGG TAGTCGAAGA GAAGACCACA CATCCTGAAC 2^00 

TCCCCAGTCT GGTGTGAGGG GAGGACAGCT GATAACTGGA TATGCAGTGT TCCCAGACAT 2460 

CACTGGTCCC AAACCATTAC TTCTGCCTGC CACTGCCACA AATACAGTAG GAATGCCATC 2520 

CCCTTCATAC TCAGCTTTAA TCCTCAGAGT TTCATCTGGT CCTTTATGCG CAGATGTTAC 2580 

TCGAAGTTCA CATGGAATGC CAAAATTTCC ACAGGCCTTC TTGATTTTTT CACAGTGACC 2640 

AAGATCAGAA GTAGAGCCCA TCAACACTAC AACCCTGCAC TGACTTTCTG ATTTCAAAAG 2700 

CAACTCTACT CTCTCTGCAA CCCACTCAAA GTTTTTCTTT ACCATTTGGA GCCCTTCAGG 2760 

AGTTACTTCT TTGAGGTCCC GATAAGACTG TTTGTCTTTC TGTTGGCTTC GATCTCCTGA 2820 

TGGCCAGAGT CTCCAGGAAT CATTGTCAAT AACATCAGCA AGAACAATTT CTTTGGTGGT 2880 

TACATCAACA CCAAATTCAA TCTTCATATC AACCAGTGTA CAATTCTGGG GCAACCAGGA 2940 

TTTCTCCAGT ATTTCAAATA TAGCCTGTGT AGCATCTCGT GCCGAATTCA AAAAGCTT 2998 



(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 4152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic ONA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

AAGCTTTTTG TGAAAACCCT AGGATATGTC CCCTCCCTCA CCACACCCAA CCCCCCGCCC 60 

CTGCCCCAGG ACATGACGAT GCCTCACACA CACACACACA CACACATACA CACAAGGCCG 120 

TGAGCTGCAC GCAGGAACAT GGGCTGCACT CACGACAACA TTGAAAAAAT ATACATTATA 180 

TATGTACACC CGGGGCCCCC ACGTCCCCTC CCGTCCCCGC AGCCTGGCCA CACCAGGTCA 240 

CGGAGGAGGG GCCGGGGCTG CAGGACCTCA GGACTGCAAG GGCAGGAAGG GAAACAGGAC 300 

AAGAAAGGAA GGAAGTTGGA AAGGAGGGAG AAATGGGGTC CCCAGACTGA AATGGAAATG 360 

AGGTGGGGCG ATCATAAGAG AAGCAGGGAC GATGGTCCAG CTGAGGGAGC CCTGCAGAGG 420 

GGGAAAAGCT TCCCATGGAC AGGAGAGAGA AGGGAAGGGG AGAGGAGAGG GTTTCCTTCA 480 

ATCCCACCCC CAGCCCCAGC CCCAGCCCCA GCCATTGCAA TCGTCACCCT CTCCCCAACA 540 

CAGTGAGTGC TAAGGGGGCA GCTGCCATTG GGGGTAGAAA GGCAGCTGAA GTCCAGCCCA 600 

CTTTCCAACC CAGCCAGCCC CAGTGCAAGG GGCACACCAG GAGCATGACA GCCCAGAAGT 660 

GAGGGATGGG GGGCCGGGGG AGGGGCAGGG CGGACTCCAG AGGGCCCGCT GGGGTTTTGA 720 

AATGAAAGGA GGACTGGTTC TGAAGCCTCT CTCCCTCTTG GTCTCTGTGT TCCCAGAAAG 780 

TCCTTCTCCC ATGTCTGGAG TGTCTGTTTC ACCAGGGCAG AATTCCCCCT CTGCGTGGGG 840 

AGAGGTGTAG GCCTTAGTAG CGGTGTGGGG GGGTCTCGAT GATGCGTCTC TCGTCGCTGC 900 

TGGGGGAATC GGCCACCTCC GAGTCACTGC TGTCCTCATC CTCCTGCTGG CCCCCAACAG 960 

CCCCCGTCAC ACAGGACTGC CGATTCTGGT AGGACTCCAT GGGGTTCACA ATGATGGTGA 1020 

GAGCTGAGTC ATCCCAGAAG AGGTCTGGGT CCTTGGGGTC ACTGGAGGCC CCTGGAGGCC 1080 

CGCCGGCCCC TGAGACGCGG CGGTGAAGGG AATGGATGCG CACCAGGCCC AGGACGACCA 1140 

TGAGCACCAG GAAGCCCACG CACACCACAA TGATGAGGGT TGCGGCGCTG GGTATCATGG 1200 

AGTTTCTGTG GGAGCTGGCT AGGCTGTGTC CAGCCATCTC AGGCGGGGGC TGGTGACCAC 1260 

GGTGCAGGAA CTGCTGGGAG CTGAGCACGT GGCTGGGGTG GGCAACCCGG TTCATGCTGT 1320 

GCAGGACATT GACCTCCACG ATGAATTCAT TGCTGGAGTA ACGGCCATTC ATTTCCGAGC 1380 

AGGAAAGCCG GAACTTCCTG GTGTAGAGGG CAGCTCCGTG TCGCAGCCGA TAACGAGCCT 1440 

GCCTCAGGAT CTCTTCATAC ACAGTGATGC TCTCCACCCC AGCAATAGTG AGGTAGGCAG 1500 

ATGTGTTGGT GAGCTCCAGC CCCCGCTGCT GCAGAGAGGT TGTGTCCAGG AGCAGGCTTT 1560 

CCCGCTCGGG ATCCAGGTCA TCCCCCACCA GAGAAATTTC ACAGCCATCC AGGTTGTGCA 1620 

CAATCTCATC CGACATGCGT GTGTCTGTCA CTGTGCCCTG CCAACTCTCA TCCTTTTTGG 1680 

CCTCCACCTG GTGAGAAATG GAGCAGGTGA TTTGAAGATC AGGGAACAAA GGGACGCCGT 1740 

TGGTTCCCTC AAAGTCCACA GCTGGGCGGG CAAAATGAGC AGTGCCACTC AGCAGGATCT 1800 

GGGGGGCGTC AGGCTGAAGG ACGACCACGT AGCCCTCCAC TTCAGGGATG GAGACGCAGG 1860 

ACTCTTCGCT GAAGCACTTG ACAGCAGTGG TGAGGCGCAG GGGCCTGACG CCGGGCGTGG 1920 

CAAAGCGCAG AGTGTTCATG TAAGCCACAT GCTGCAGGGC ATGGTTGAAG GTCTCCACAT 1980 

CATCCCCCTC CAGGGTGAGC AGGGACTGTG ACGGGTTCAC GTGGACCTTC ATGCCTTTGC 2040 

CCAGGCTCTC GAAATCCCTA TAGTCCAGCC CCTCCCGACA TGCATAGAGG CACTCGATGA 2100 

CCTCGCGGCT CTCCAGGCGA CCTGAGCGCA CGCTGAAACC AGCCAGGTAG CCATGGAAGT 2160 

AGTGGTGGAT CGACAAAGGG TCTCCTTGGG TGGTGTCTGT ACTGTTGTCT CCCTTTTCCT 2220 

TCTCTTTGTT CTTCTCCTCA GTCCAGCAGG CCCCAATCAT GAGAGCAGGC TCCCTTCGGG 2280 

GTGGGTGGAT GAGGCCATTG TCATGGATGA GGGCAGGGTC GAAGGAGATG CCGTCGGTAT 2340 

AGAGTGTGAC TGTGGGGAAC TCGAGGTTCA GAGCGTAGTG GTGCCACTCA TCATCACAGA 2400 

CCTGCTCCAG CTTCCAGAGG AACTTGACTG GGCGGGCACT CTCAAGCAGG GGCCAGTAGA 2460 

GGAAGGCAAT CCTACAGCCG TGGACAGTCA GCGAGTAGTG AGAGAAGCCG TCCTCATTCT 2520 

GGACAGTGTT ACATACGATG GTTTCCTCTT CCTTCTTGCC CTTGTTGGGA GTTACGCCAT 2580 

GCTTCATCCA GAAGGACAGG GTGAAGTGGT CACTGAGGCT GTCCTGGGGC CCAGAGCCCA 2640 
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GCCCACTGGG GCCACCCAGG GGCACCTGCA CAGCCTGGGT GCCATTGAAC CAGTAGATCA 2700 

GGCTGCTGTC CTGGCTGTAG TGCACCGAGA GTCCTGCTGT CCAGTTGGCA TTGGGGCCAG 2760 

GCATGGGCAA CAGATCCACT TCCCCAGTGG CAGCACCACA GAGTTTCCGC AGCGCCCGCT 2820 

CTGAGTAGTT GTCACGGTCA CAGCCCTTGG CCACATGGCT GGTCTGCAGC TCTATGGTGG 2880 

CCTGAATGTT CCAGAGTGGT TCATCACAGG TCTCCAGGCG GATACCAGGG AACAAAGCCA 2940 

AGCTCCCAGC ACCTGGTGCA TATTCGATCC TTTTGTTCCA GCCTTGCCAG CTGGGTTTAC 3000 

AGGTGGGCTT CACCTGAATC TCCACCTCAG CATCATCTGC TGCCCGCTTC TTCCCACAGT 3060 

CATAAGCTGT CACTGTAAAC TTATAGAGCC TCTCACCACT GTACTGCAGC TTCTCTGTGT 3120 

TCTCAATGTT CCCGTCATTG TCAATGAGGA AAGGGGTGTT GGGTGTGAGA ATCTCATAGT 3180 

AGCAGATCTG GCTGTACTGG GGGGAGCAGT CACCGTCAAT GGCTTCCACC CGCAGGATGC 3240 

GATCGTACAG CTTCCCCTCT GTCACAGCCG CACGATACAG CCGTTCCACA AACACTGGGG 3300 

CAAACTCGTT CACATCGTTG ACCCGCACAT GCACAGTGGC CTTGTGGGAC TTCTTGGTGT 3360 

TGGCCCCGTC GGGGCCCTCG CCACAGTCAT AGGCCTGGAT GGTGAAGGTG TGTTCCTTCT 3420 

GGGCCTCGCA GTCCACAGGC TCCTTGGCCC GGATCAGCCC CTCTCCTGTC GCCTTGTCAA 3480 

GGATCACAGC CTCAAAGGGC ACCCCAGACC CATGGAGCCG GAAGCCGCAG ATCTCACCTG 3540 

CATAGCGCAG CGGGGCATCC TTGTCCAAGG CAAAGAGTGG TGGATTCAGT AGGACCGTGT 3600 

TGTCATTCTC CATGACGATG CCCTGGTACT CTGCCTCAAT CCATGGCTTG TGCTTGTTGG 3660 

CTTTGTTACA GGAGCAGGAC GCGAGCAGAG AGGCCAGCAG AAGGGGCAGC AGCAGGAGGG 3720 

TCATGGTGCG GCGTGGGGCA GGGCAGGGCC AGGCGTTTGC CTCCCCTGGG AGCCTCCAGC 3780 

CTGCGGATTC CACCTTGCGG GAGGGATACA GGGGGGGAAA ACCAAAATAA AACGTCAAAT 3840 

AAATTGTGTA GGAGGAGTCC AGCTTAGGAC CGGGCCAGAG CCAGGCCAGG CTCGGGGAGG 3900 

GGGCCTCTGC AGGTTCAGAG GATCACTGCT GCCACCACCG CCACCCTGGG AGCCAGTTAT 3960 

TTTGCCATGG CCTTGATTGC AACAGCTGCC TCCTCTGTCA TGGCAGACAG CACCGTGATC 4020 

AGGATCTCTT CTCCACAGTC GTACTTCTGC TCAATCTCCT TGCCAAGGTC TCCCTCAGGG 4080 

AGACGAAGGT CCTCTCGTAC CTCCCCGCTG TCCTGGAGCA GTGATAGGTA CCCATCCTGG 4140 

ATCTTTGGAT CC 4152 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3117 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii ) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

GGATCCAAAG ATTCGGCACG AGTGGCCACA TCATGAACCT CCAGGCCCAG CCCAAGGCTC 60 

AGAACAAGCG GAAGCGTTGC CTCTTTGGGG GCCAGGAACC AGCTCCCAAG GAGCAGCCCC 120 

CTCCCCTGCA GCCCCCCCAG CAGTCCATCA GAGTGAAGGA GGAGCAGTAC CTCGGGCACG 180 

AGGGTCCAGG AGGGGCAGTC TCCACCTCTC AGCCTGTGGA ACTGCCCCCT CCTAGCAGCC 240 

TGGCCCTGCT GAACTCTGTG GTGTATGGGC CTGAGCGGAC CTCAGCAGCC ATGCTGTCCC 300 

AGCAGGTGGC CTCAGTAAAG TGGCCCAACT CTGTGATGGC TCCAGGGCGG GGCCCGGAGC 360 

GTGGAGGAGG TGGGGGTGTC AGTGACAGCA GCTGGCAGCA GCAGCCAGGC CAGCCTCCAC 420 

CCCATTCAAC ATGGAACTGC CACAGTCTGT CCCTCTACAG TGCAACCAAG GGGAGCCCGC 480 

ATCCTGGAGT GGGAGTCCCG ACTTACTATA ACCACCCTGA GGCACTGAAG CGGGAGAAAG 540 

CGGGGGGCCC ACAGCTGGAC CGCTATGTGC GACCAATGAT GCCACAGAAG GTGCAGCTGG 600 

AGGTAGGGCG GCCCCAGGCA CCCCTGAATT CTTTCCACGC AGCCAAGAAA CCCCCAAACC 660 

AGTCACTGCC CCTGCAACCC TTCCAGCTGG CATTCGGCCA CCAGGTGAAC CGGCAGGTCT 720 

TCCGGCAGGG CCCACCGCCC CCAAACCCGG TGGCTGCCTT CCCTCCACAG AAGCAGCAGC 780 

AGCAGCAGCA ACCACAGCAG CAGCAGCAGC AGCAGCAGGC AGCCCTACCC CAGATGCCGC 840 

TCTTTGAGAA CTTCTATTCC ATGCCACAGC AACCCTCGCA GCAACCCCAG GACTTTGGCC 900 

TGCAGCCAGC TGGGCCACTG GGACAGTCCC ACCTGGCTCA CCACAGCATG GCACCCTACC 960 

CCTTCCCCCC CAACCCAGAT ATGAACCCAG AACTGCGCAA GGCCCTTCTG CAGGACTCAG 1020 

CCCCGCAGCC AGC6CTACCT CAGGTCCAGA TCCCCTTCCC CCGCCGCTCC CGCCGCCTCT 1080 

CTAAGGAGGG TATCCTGCCT CCCAGCGCCC TGGATGGGGC TGGCACCCAG CCTGGGCAGG 1140 

AGGCCACTGG CAACCTGTTC CTACATCACT GGCCCCTGCA GCAGCCGCCA CCTGGCTCCC 1200 

TGGGGCAGCC CCATCCTGAA GCTCTGGGAT TCCCGCTGGA GCTGAGGGAG TCGCAGCTAC 1260 

TGCCTGATGG GGAGAGACTA GCACCCAATG GCCGGGAGCG AGAGGCTCCT GCCATGGGCA 1320 

GCGAGGAGGG CATGAGGGCA GTGAGCACAG GGGACTGTGG GCAGGTGCTA CGGGGCGGAG 1380 

TGATCCAGAG CACGCGACGG AGGCGCCGGG CATCCCAGGA GGCCAATTTG CTGACCCTGG 1440 

CCCAGAAGGC TGTGGAGCTG GCCTCACTGC AGAATGCAAA GGATGGCAGT GGTTCTGAAG 1500 

AGAAGCGGAA AAGTGTATTG GCCTCAACTA CCAAGTGTGG GGTGGAGTTT TCTGAGCCTT 1560 

CCTTAGCCAC CAAGCGAGCA CGAGAAGACA GTGGGATGGT ACCCCTCATC ATCCCAGTGT 1620 

CTGTGCCTGT GCGAACTGTG GACCCAACTG AGGCAGCCCA GGCTGGAGGT CTTGATGAGG 1680 

ACGGGAAGGG TCTTGAACAG AACCCTGCTG AGCACAAGCC ATCAGTCATC GTCACCCGCA 1740 

GGCGGTCCAC CCGAATCCCC GGGACAGATG CTCAAGCTCA GGCGGAGGAC ATGAATGTCA 1800 

AGTTGGAGGG GGAGCCTTCC GTGCGGAAAC CAAAGCAGCG GCCCAGGCCC GAGCCCCTCA 1860 

TCATCCCCAC CAAGGCGGGC ACTTTCATCG CCCCTCCCGT CTACTCCAAC ATCACCCCAT 1920 

ACCAGAGCCA CCTGCGCTCT CCCGTGCGCC TAGCTGACCA CCCCTCTGAG CGGAGCTTTG 1980 

AGCTACCTCC CTACACGCCG CCCCCCATCC TCAGCCCTGT GCGGGAAGGC TCTGGCCTCT 2040 
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ACTTCAATGC CATCATATCA ACCAGCACCA TCCCTGCCCC TCCTCCCATC ACGCCTAAGA 2100 

GTGCCCATCG CACGCTGCTC CGGACTAACA GTGCTGAAGT AACCCCGCCT GTCCTCTCTG 2160 

TGATGGGGGA GGCCACCCCA GTGAGCATCG AGCCACGGAT CAACGTGGGC TCCCGGTTCC 2220 

AGGCAGAAAT CCCCTTGATG AGGGACCGTG CCCTGGCAGC TGCAGATCCC CACAAGGCTG 2280 

ACTTGGTGTG GCAGCCATGG GAGGACCTAG AGAGCAGCCG GGAGAAGCAG AGGCAAGTGG 2340 

AAGACCTGCT GACAGCCGCC TGCTCCAGCA TTTTCCCTGG TGCTGGCACC AACCAGGAGC 2400 

TGGCCCTGCA CTGTCTGCAC GAAT CCAGAG GAGACATCCT GGAAACGCTG AAIAAGCTGC 2460 

TGCTGAAGAA GCCCCTGCGG CCCCACAACC ATCCGCTGGC AACTTATCAC TACACAGGCT 2520 

CTGACCAGTG GAAGATGGCC GAGAGGAAGC TGTTCAACAA AGGCATTGCC ATCTACAAGA 2580 

AGGATTTCTT CCTGGTGCAG AAGCTGATCC AGACCAAGAC CGTGGCCCAG TGCGTGGAGT 2640 

TCTACTACAC CTACAAGAAG CAGGTGAAAA TCGGCCGCAA TGGGACTCTA ACCTTTGGGG 2700 

ATGTGGATAC GAGCGATGAG AAGTCGGCCC AGGAAGAGGT TGAAGTGGAT ATTAAGACTT 2760 

CCCAAAAGTT CCCAAGGGTG CCTCTTCCCA GAAGAGAGTC CCCAAGTGAA GAGAGGCTGG 2820 

AGCCCAAGAG GGAGGTGAAG GAGCCCAGGA AGGAGGGGGA GGAGGAGGTG CCAGAGATCC 2880 

AAGAGAAGGA GGAGCAGGAA GAGGGGCGAG AGCGCAGCAG GCGGGCAGCG GCAGTCAAAG 2940 

CCACGCAGAC ACTACAGGCC AATGAGTCGG CCAGTGACAT CCTCATCCTC CGGAGCCACG 3000 

AGTCCAACGC CCCTGGGTCT GCCGGTGGCC AGGCCTCGGA GAAGCCAAGG GAAGGGACAG 3060 

GGAAGTCACG AAGGGCACTA CCTTTTTCAG AAAAAAAAAA AAAAAAACAA AAAGCTT 3117 



(2) INFORMATION FOR SEQ 10 NO:7: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3306 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEONESS: double 
(0) TOPOLOGY: linear 

Cii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



GAATTCGGCA CGAGGTCAGT TTCCTGTGGA ACACAGAGGC TGCCTGTCCC ATTCAGACAA 60 

CGACGGATAC AGACCAGGCT TGCTCTATAA GGGATCCCAA CAGTGGATTT GTGTTTAATC 120 

TTAATCCGCT AAACAGTTCG CAAGGATATA ACGTCTCTGG CATTGGGAAG ATTTTTATGT 180 

TTAATGTCTG CGGCACAATG CCTGTCTGTG GGACCATCCT GGGAAAACCT GCTTCTGGCT 240 

GTGAGGCAGA AACCCAAACT GAAGAGCTCA AGAATTGGAA GCCAGCAAGG CCAGTCGGAA 300 

TTGAGAAAAG CCTCCAGCTG TCCACAGAGG GCTTCATCAC TCTGACCTAC AAAGGGCCTC 360 

TCTCTGCCAA AGGTACCGCT GATGCTTTTA TCGTCCGCTT TGTTTGCAAT GATGATGTTT 420 

ACTCAGGGCC CCTCAAATTC CTGCATCAAG ATATCGACTC TGGGCAAGGG ATCCGAAACA 480 

CTTACTTTGA GTTTGAAACC GCGTTGGCCT GTGTTCCTTC TCCAGTGGAC TGCCAAGTCA 540 

CCGACCTGGC TGGAAATGAG TACGACCTGA CTGGCCTAAG CACAGTCAGG AAACCTTGGA 600 

CGGCTGTTGA CACCTCTGTC GATGGGAGAA AGAGGACTTT CTATTTGAGC GTTTGCAATC 660 

CTCTCCCTTA CATTCCTGGA TGCCAGGGCA GCGCAGTGGG GTCTTGCTTA GTGTCAGAAG 720 

GCAATAGCTG GAATCTGGGT GTGGTGCAGA TGAGTCCCCA AGCCGCGGCG AATGGATCTT 780 

T GAG CAT CAT GTATGTCAAC GGTGACAAGT GTGGGAACCA GCGCTTCTCC ACCAGGATCA 840 

CGTTTGAGTG TGCTCAGATA TCGGGCTCAC CAGCATTTCA GCTTCAGGAT GGTTGTGAGT 900 

ACGTGTTTAT CTGGAGAACT GTGGAAGCCT GTCCCGTTGT CAGAGTGGAA GGGGACAACT 960 

GTGAGGTGAA AGACCCAAGG CATGGCAACT TGTATGACCT GAAGCCCCTG GGCCTCAACG 1020 

ACACCATCGT GAGCGCTGGC GAATACACTT ATTACTTCCG GGTCTGTGGG AAGCTTTCCT 1080 

CAGACGTCTG CCCCACAAGT GACAAGTCCA AGGTGGTCTC CTCATGTCAG GAAAAGCGGG 1140 

AACCGCAGGG ATTTCACAAA GTGGCAGGTC TCCTGACTCA GAAGCTAACT TATGAAAATG 1200 

GCTTGTTAAA AATGAACTTC ACGGGGGGGG ACACTTGCCA TAAGGTTTAT CAGCGCTCCA 1260 

CAGCCATCTT CTTCTACTGT GACCGCGGCA CCCAGCGGCC AGTATTTCTA AAGGAGACTT 1320 

CAGATTGTTC CTACTTGTTT GAGTGGCGAA CGCAGTATGC CTGCCCACCT TTCGATCTGA 1380 

CTGAATGTTC ATTCAAAGAT GGGGCTGGCA ACTCCTTCGA CCTCTCGTCC CTGTCAAGGT 1440 

ACAGTGACAA CTGGGAAGCC ATCACTGGGA CGGGGGACCC GGAGCACTAC CTCATCAATG 1500 

TCTGCAAGTC TCTGGCCCCG CAGGCTGGCA CTGAGCCGTG CCCTCCAGAA GCAGCCGCGT 1560 

GTCTGCTGGG TGGCTCCAAG CCCGTGAACC TCGGCAGGGT AAGGGACGGA CCTCAGTGGA 1620 

GAGATGGCAT AATTGTCCTG AAATACGTTG ATGGCGACTT ATGTCCAGAT GGGATTCGGA 1680 

AAAAGTCAAC CACCATCCGA TTCACCTGCA GCGAGAGCCA AGTGAACTCC AGGCCCATGT 1740 

TCATCAGCGC CGTGGAGGAC TGTGAGTACA CCTTTGCCTG GCCCACAGCC ACAGCCTGTC 1800 

CCATGAAGAG CAACGAGCAT GATGACTGCC AGGTCACCAA CCCAAGCACA GGACACCTGT 1860 

TTGATCTGAG CTCCTTAAGT GGCAGGGCGG GATTCACAGC TGCTTACAGC GAGAAGGGGT 1920 

TGGTTTACAT GAGCATCTGT GGGGAGAATG AAAACTGCCC TCCTGGCGTG GGGGCCTGCT 1980 

TTGGACAGAC CAGGATTAGC GTGGGCAAGG CCAACAAGAG GCTGAGATAC GTGGACCAGG 2040 

TCCTGCAGCT GGTGTACAAG GATGGGTCCC CTTGTCCCTC CAAATCCGGC CTGAGCTATA 2100 

AGAGTGTGAT CAGTTTCGTG TGCAGGCCTG AGGCCGGGCC AACCAATAGG CCCATGCTCA 2160 

TCTCCCTGGA CAAGCAGACA TGCACTCTCT TCTTCTCCTG GCACACGCCG CTGGCCTGCG 2220 

AGCAAGCGAC CGAATGTTCC GTGAGGAATG GAAGCTCTAT TGTTGACTTG TCTCCCCTTA 2280 

TTCATCGCAC TGGTGGTTAT GAGGCTTATG ATGAGAGTGA GGATGATGCC TCCGATACCA 2340 

ACCCTGATTT CTACATCAAT ATTTGTCAGC CACTAAATCC CATGCACGGA GTGCCCTGTC 2400 

CTGCCGGAGC CGCTGTGTGC AAAGTTCCTA TTGATGGTCC CCCCATAGAT ATCGGCCGGG 2460 

TAGCAGGACC ACCAATACTC AATCCAATAG CAAATGAGAT T TACT T GAAT TTTGAAAGCA 2520 
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GTACTCCTTG CCAGGAATTC AGTTGTAAAT AAAATTGAAC CTGCTCAACA GCTGAGGGAG 2580 

ACTAGAAATG ATGGGTCCAT ATCCTGGTGC ATTGTCATAC AATTCAAACA ATGGTGCAGC 2640 

TACCAGCTTG TAATTTTTAG GGACTGCAAA CAAGGCTTTT TCTTGAAGCT GAACCAGAAA 2700 

CAACTTCTTA TGTTCCTTAG GCTTTGTAAT ATGTGCAGGA ATATATGGAT ACTGAGGAGG 2760 

TTCAAAATTT GGTCTCCACC AGTTACCAAT GCAATCGTCA ATGACCCAGT CTTGCAAAAC 2820 

TCCATCCTGA CGACCCAGTA TCTCTGTCAT TAAGCGTTTT AGTCCTTCAA CTTCATCTTC 2880 

TCCTGGGTTA AGTTCACCAC CAGGTAGTTT GAAGAAAGTT GTTCCCAGCT GCAGCAGTAA 2940 

CACATGGGGT AGCCGGTGCT CATGTACAAT CAGAACCCCT TCTACAGTCC TCCTCATTCC 3000 

AATTTTATCA AATTCTTCCC TCATGCGCTG AAATCTGGCT GCAACAGAGC TGTCCTTCTC 3060 

GTAGAGGGGC TCTTTTGTAC CAAAAGTATA ATTGGTAAGA GGGTACAGGT TGATGGTGCG 3120 

CTCCAGGGTG AGGGGCTTCG TCTGCTGGAT GTACTTGTTG CCGAACTGAG TGACCCCCCG 3180 

GGGCCAGCCG GTCTGCGAGC GATTGGGCGG TACCACAGAC ATGCTGGCGA GCTCCGGCGC 3240 

TGACGGCGAG CAGAAAGTGG CAGGCAGGGT AGACTTTCCC CGTGCGGGAA GCCTCGTGCC 3300 

GAATTC 3306 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4218 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

GAATTCGGCA CGAGAATGGA TCAACCTCAA CAACACGTTA AAGCTAGACG AAAGAAGTAA 60 

TACACAGTGT ATGAGTCTCA CATGAAATAC CCGGATGTAA ATCCAAAGAA ACAGGAAGCA 120 

GATTGGTGGT TGCCAGGGAC AAGGGCGGTG GGAGGAGAAA ATGGAGAGTA ACGGGACTTT 180 

ACTTTTGGAG TGATGAGAAT GTTTTGGAGC TAGATAGAAG TGGTGGTTGT ACACCATTGT 240 

GGATGTACTA CCACTTAATT GTTCACTTAA AAAGTTAATT TATGTGAATT GCATCTTAAT 300 

TAAAAACAAG GATAACATTC CAACTCCTGG ACATTATCCT TCCTTTCCAT TTGATGTCAG 360 

GCCCGTGTTA GAATTCTCAT CCGGTTTGGT CACTGCACTT AAGATGTGGA GAAATTAGGA 420 

CGCACAGTTA AGAGGAAGGA TAACACTGAT TAAGGTAGTG CTTTTCTAGG TTTCCCCTAA 480 

ACAATTTAAC AGATGGATAG TGGCACCACT TACGAGATGG AAAAACCAGC GGAAGGAAGA 540 

TTTGGGGGAG AAGTTAAGTT TGTCTTGGGC CTGTGTTTTG CAACCTGAGT GTAAAAGACA 600 

TATGTTAAGT CTTCAGTGGC GAAACACTAA AACTAGAAAT GGATCAGAAT TTTATCTTTG 660 

GATGTGACTT CTCAAGGATG GTCTTGTCAC TTCAGTGCCT GGTCAAATGA CAAGATGGGC 720 

AATCTTTTCC TGAAGGTCCA AGCACCTGAA CGTGGCAGGG TGACCCGATT CCGATTTGCT 780 

TAGAACAATC CTAGTTCATG CCTATTGTCC CTCATGTAAT TAATATCACT CTCAAAATGT 840 

CTCATTTTGT GCAATAAATT CTGCAACGTG ATGGCGCGAC TCTCGCGGCC CGAGCGGCCG 900 

GACCTTGTCT TCGAGGAAGA GGACCTCCCC TATGAGGAGG AAATCATGCG GAACCAATTC 960 

TCTGTCAAAT GCTGGCTTCA CTACATCGAG TTCAAACAGG GCGCCCCGAA GCCCAGGCTC 1020 

AATCAGCTAT ACGAGCGGGC ACTCAAGCTG CTGCCCTGCA GCTACAAACT CTGGTACCGA 1080 

TACCTGAAGG CGCGTCGGGC ACAGGTGAAG CATCGCTGTG TGACCGACCC TGCCTATGAA 1140 

GATGTCAACA ACTGTCATGA GAGGGCCTTT GTGTTCATGC ACAAGATGCC TCGTCTGTGG 1200 

CTAGATTACT GCCAGTTCCT CATGGACCAG GGGCGCGTCA CACACACCCG CCGCACCTTC 1260 

GACCGTGCCC TCCGGGCACT GCCCATCACG CAGCACTCTC GAATTTGGCC CCTGTATCTG 1320 

CGCTTCCTGC GCTCACACCC ACTGCCTGAG ACAGCTGTGC GAGGCTATCG GCGCTTCCTC 1380 

AAGCTGAGTC CTGAGAGTGC AGAGGAGTAC ATTGAGTACC TCAAGTCAAG TGACCGGCTG 1440 

GATGAGGCCG CCCAGCGCCT GGCCACCGTG GTGAACGACG AGCGTTTCGT GTCTAAGGCC 1500 

GGCAAGTCCA ACTACCAGCT GTGGCACGAG CTGTGCGACC TCATCTCCCA GAATCCGGAC 1560 

AAGGTACAGT CCCTCAATGT GGACGCCATC ATCCGCGGGG GCCTCACCCG CTTCACCGAC 1620 

CAGCTGGGCA AGCTCTGGTG TTCTCTCGCC GACTACTACA TCCGCAGCGG CCATTTCGAG 1680 

AAGGCTCGGG ACGTGTACGA GGAGGCCATC CGGACAGTGA TGACCGTGCG GGACTTCACA 1740 

CAGGTGTTTG ACAGCTACGC CCAGTTCGAG GAGAGCATGA TCGCTGCAAA GATGGAGACC 1800 

GCCTCGGAGC TGGGGCGCGA GGAGGAGGAT GATGTGGACC TGGAGCTGCG CCTGGCCCGC 1860 

TTCGAGCAGC TCATCAGCCG GCGGCCCCTG CTCCTCAACA GCGTCTTGCT GCGCCAAAAC 1920 

CCACACCACG TGCACGAGTG GCACAAGCGT GTCGCCCTGC ACCAGGGCCG CCCCCGGGAG 1980 

ATCATCAACA CCTACACAGA GGCTGTGCAG ACGGTGGACC CCTTCAAGGC CACAGGCAAG 2040 

CCCCACACTC TGTGGGTGGC GTTTGCCAAG TTTTATGAGG ACAACGGACA GCTGGACGAT 2100 

GCCCGTGTCA TCCTGGAGAA GGCCACCAAG GTGAACTTCA AGCAGGTGGA TGACCTGGCA 2160 

AGCGTGTGGT GTCAGTGCGG AGAGCTGGAG CTCCGACACG AGAACTACGA TGAGGCCTTG 2220 

CGGCTGCTGC GAAAGGCCAC GGCGCTGCCT GCCCGCCGGG CCGAGTACTT TGATGGTTCA 2280 

GAGCCCGTGC AGAACCGCGT GTACAAGTCA CTGAAGGTCT GGTCCATGCT CGCCGACCTG 2340 

GAGGAGAGCC TCGGCACCTT CCAGTCCACC AAGGCCGTGT ACGACCGCAT CCTGGACCTG 2400 

CGTATCGCAA CACCCCAGAT CGTCATCAAC TATGCCATGT TCCTGGAGGA GCACAAGTAC 2460 

TTCGAGGAGA GCTTCAAGGC GTACGAGCGC GGCATCTCGC TGTTCAAGTG GCCCAACGTG 2520 

TCCGACATCT GGAGCACCTA CCTGACCAAA TTCATTGCCC GCTATGGGGG CCGCAAGCTG 2580 

GAGCGGGCAC GGGACCTGTT TGAACAGGCT CTGGACGGCT GCCCCCCAAA ATATGCCAAG 2640 

ACCTTGTACC TGCTGTACGC ACAGCTGGAG GAGGAGTGGG GCCTGGCCCG GCATGCCATG 2700 

GCCGTGTACG AGCGTGCCAC CAGGGCCGTG GAGCCCGCCC AGCAGTATGA CATGTTCAAC 2760 
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ATCTACATCA AGCGGGCGGC CGAGATCTAT GGGGTCACCC ACACCCGCGG CATCTACCAG 2820 

AAGGCCATTG AGGTGCTGTC GGACGAGCAC GCGCGTGAGA TGTGCCTGCG GTTTGCAGAC 2880 

ATGGAGTGCA AGCTCGGGGA GATTGACCGC GCCCGGGCCA TCTACAGCTT CTGCTCCCAG 2940 

ATCTGTGACC CCCGGACGAC CGGCGCGTTC TGGCAGACGT GGAAGGACTT TGAGGTCCGG 3000 

CATGGCAATG AGGACACCAT CAAGGAAATG CTGCGTATCC GGCGCAGCGT GCAGGCCACG 3060 

TACAACACGC AGGTCAACTT CATGGCCTCG CAGATGCTCA AGGTCTCGGG CAGTGCCACG 3120 

GGCACCGTGT CTGACCTGGC CCCTGGGCAG AGTGGCATGG ACGACATGAA GC1 GCTGGAA 3180 

CAGCGGGCAG AGCAGCTGGC GGCTGAGGCG GAGCGTGACC AGCCCTTGCG CGCCCAGAGC 3240 

AAGATCCTGT TCGTGAGGAG TGACGCCTCC CGGGAGGAGC TGGCAGAGCT GGCACAGCAG 3300 

GTCAACCCCG AGGAGATCCA GCTGGGCGAG GACGAGGACG AGGACGAGAT GGACCTGGAG 3360 

CCCAACGAGG TTCGGCTGGA GCAGCAGAGC GTGCCAGCCG CAGTGTTTGG GAGCCTGAAG 3420 

GAAGACTGAC CCGTCCCCTC GTGCCGAATT CGGCACGAGC AAGACCAGCC CCCAGATCAT 3480 

TTGCCTCAAA GGTTTTCCCT CGAAGTCACA AATGTTTCAA GGAATCTCAA ATTTTACAAA 3540 

GTTTGAAGTG TGGGCATTGG TGGCCTGTGG CTGTGTCCTC TCTCTGTAGC TGTTTTCTCC 3600 

CTACATCCCT GAAAGGAAGT TGAGCCTGCT CCTCCATCCG CAGACCTCCC TTTCCAGCGC 3660 

CCAGGGCATG GGGTGCTGTG AGGGCAGCAT GCTAGGTGTG ACCGTGCTCC TGGCCTCCAG 3720 

GCCCGTGTCC CTCTGTCCTC TA6CCCACTA AGGCCCTGGC CCATTTGTGC TAAACAGGCA 3780 

GTCGGACCTA GAAAGAGCAG ACAATCTCTC TGGGTCACCA GTCTGGCTAG GAGCTGGTCT 3840 

CCTGACTGGG ATCCAGGCCT TCTCCCCTGC CCATGTGAAT TCCCAGGGGC AGAGCCTGAA 3900 

ATGTTGAACA CAGCACTGGC CAAAGAGATG TCACCGTGGG AACCGAGGCT CTCTTCTCCT 3960 

CCTGCCTGCT TTCGTGGGTT CAGAGTAGCT GAGGCTTGTC TGAGAGGAGT TGGAGTGCTG 4020 

GTTTTCACCC TGGTTGGTGT GCTTTGCTTT GAGGGCACTT AGAAAGCCCA GCCCAGCCCT 4080 

TGCTCCTGCC CTGCACACAG CGGAGCGACT TTTCTAGGTA TGCTCTTGAT TTCTGCAGAA 4140 

GCAGCAGGTG GCATGGAGCC AAGAGGAAGT GTGACTGAAA CTGTCCACTC ATAGCCCGGC 4200 

TGCCGTATTG AGAGGGCT 4218 

(2) INFORMATION FOR SEQ 10 NO:9: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic ONA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

GAGCTCGCGC GCCTGCAGGT CGACACTAGT GGATCCAAAG AATTCGGCAC GAGGGAAACT 60 

CAACGGTGTA CGAGTGGAGG ACAGGGACAG AGCCCTCTGT GGTGGAACGA CCCCACCTCG 120 

AGGAGCTTCC TGAGCAGGTG GCAGAAGATG CGATTGACTG GGGCGACTTT GGGGTAGAGG 180 

CAGTGTCTGA GGGGACTGAC TCTGGCATCT CTGCCGAGGC TGCTGGAATC GACTGGGGCA 240 

TCTTCCCGGA ATCAGATTCA AAGGATCCTG GAGGTGAT6G GATAGACTGG GGAGACGATG 300 

CTGTTGCTTT GCAGATCACA GTGCTGGAAG CAGGAACCCA GGCTCCAGAA GGTGTTGCCA 360 

GGGGCCCAGA TGCCCTGACA CTGCTTGAAT AC ACT GAG AC CCGGAATCAG TTCCTTGATG 420 

AGCTCATGGA GCTTGAGATC TTCTTAGCCC AGAGAGCAGT GGAGTTGAGT GAGGAGGCAG 480 

ATGTCCTGTC TGTGAGCCAG TTCCAGCTGG CTCCAGCCAT CCTGCAGGGC CAGACCAAAG 540 

AGAAGATGGT TACCATGGTG TCAGTGCTGG AGGATCTGAT TGGCAAGCTT ACCAGTCTTC 600 

AGCTGCAACA CCTGTTTATG ATCCTGGCCT CACCAAGGTA TGTGGACCGA GTGACTGAAT 660 

TCCTCCAGCA AAAGCTGAAG CAGTCCCAGC TGCTGGCTTT GAAGAAAGAG CTGATGGTGC 720 

AGAAGCAGCA GGAGGCACTT GAGGAGCAGG CGGCTCTGGA GCCTAAGCTG GACCTGCTAC 780 

TGGAGAAGAC CAAGGAGCTG CAGAAGCTGA TTGAAGCTGA CATCTCCAAG AGGTACAGCG 840 

GGCGCCCTGT GAACCTGATG GGAACCTCTC TGTGACACCC TCCGTGTTCT TGCCTGCCCA 900 

TCTTCTCCGC TTTTGGGATG AAGATGATAG CCAGGGCTGT TGTTTTGGGG CCCTTCAAGG 960 

CAAAAGACCA GGCTGACTGG AAGATGGAAA GCCACAGGAA GGAAGCGGCA CCTGATGGTG 1020 

ATCTTGGCAC TCTCCATGTT CTCTACAAGA AGCTGTGGTG ATTGGCCCTG TGGTCTATCA 1080 

GGCGAAAACC ACAGATTCTC CTTCTAGTTA GTATAGCGCA AAAAGCTTCT CGAGAGTACT 1140 

TCTAGAGCGG CCGCGGGCCC ATCGATTTTC CACCCGGGTG GGGTACC 1187 

<2) INFORMATION FOR SEQ 10 NO:10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3306 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCCTCACTAA AGGGAACAAA AGCTGGAGCT CGCGCGCCTG CAGGTCGACA CTAGTGGATC 60 

GAAAGTTCGT TACGCCAAGC TCGAAATTAA CTCTGGGCTG ACCCATAAAC ATTTGTCTGA 120 



WO 99/58559 



9 



PCT/US99/10793 



TCTAGGATAT AGTTGCGTTT CTTGCGGGCA GCAATCTGGA TGAGGCGGTT GAGGCACTGG 180 

GTGGCCTGCT GGATCAGGAC ATCCCAGCGG CCAGCATAGT TCCGCTGCCG GCGTAGGCCC 240 

ATCACCCGCA TCTTATCCAT GATGGCATTG GTACCCAGGA TGTTGTACTT CTTGGAAGGG 300 

TTGGAGGCTG CATGTTTGAT GGCCCATGTG GTCTTGCCAG CAGCAGGCAG GCCCACCATC 360 

ATCAGAATCT CACATTCTGC CTTGCTCTTT GGTCCAACGG TGCCCCGGAT ACGCTCACTA 420 

AGGGGAAGGT GCTGGATGAA GGTAAACCCC GGGAGGACAG AACAGTAGGG CTCTGCTCTC / : *Q 

TGTCCGAAGT TGAACTCCAC TGCGCAATTC TTCACCAGGA CATGAGGATA GAGGGCC7GA 540 

CCCCCCAAGG CTTCCTTCTG GATTCGGAAA GCAATGCCCA TCCACTTTCC ATTCTTGGTA 600 

AAAGACAGTT CCACGTCATT TCCACATTCA AAATCCGCAA AGCAGCCAAT CACCGGAGAG 660 

CTCTGCGGTG CTAGGAGAGC GGCTGGGCCC GCAGACTGGG GGGAAAGCTC CGCAGCCGCA 720 

GTGGGCCCCA GGATCAGGCC CCGCGTGGCC TGGAGAAGCC CAGTCTGGGC TGGAGCGGGA 780 

GCTGGACAGT GTGGCCTTGC GTTCGCCCCC GGGAGCGCTG CGAGTGTCGC GGCCTCGGGT 840 

GGATTTGCTG AGCACCAATA CCTCACGGTT GCCAACCTGG GGTTTTAGCT CCCTTGGTTT 900 

TAATCCCCTA GGGGCGGGTG GGGGCACGGG AGGAAGGATG GGCCAGCTGG GTGCAATCCT 960 

GCTGTAAGCC AGCCATTCCT TGATTTCTTA GAATTAACTA AACGGTCGCG CCGGAGGCCG 1020 

CGGGGGCCGG AGCGGAGCAG CCGCGGCTGA GGTTCCCGAG TCGGCCGCTC GGGGCTGCGC 1080 

TCCGCCGCCG GGACCCCGGC CTCTGGCCGC GCCGGCTCCG GCCTCCGGGG GGGCCGGGGC 1140 

CGCCGGGACA TGGTGCCAGT CGCACCCCTT CCCCGCCGCC GCTGAGCTCG CCGGCCGCGC 1200 

CCGGGCTGGG ACGTCCGAGC GGGAAGATGT TTTCCGCCCT GAAGAAGCTG GTGGGGTCGG 1260 

ACCAGGCCCC GGGCCGGGAC AAGAACATCC CCGCCGGGCT GCAGTCCATG AACCAGGCGT 1320 

TGCAGAGGCG CTTCGCCAAG GGGGTGCAGT ACAACATGAA GATAGTGATC CGGGGAGACA 1380 

GGAACACGGG CAAGACAGCG CTGTGGCACC GCCTGCAGGG CCGGCCGTTC GTGGAGGAGT 1440 

ACATCCCCAC ACAGGAGATC CAGGTCACCA GCATCCACTG GAGCTACAAG ACCACGGATG 1500 

ACATCGTGAA GGTTGAAGTC TGGGATGTAG TAGACAAAGG AAAATGCAAA AAGCGAGGCG 1560 

ACGGCTTAAA GATGGAGAAC GACCCCCAGG AGNCGGAGTC TGAAATGGCC CTGGATGCTG 1620 

AGTTCCTGGA CGTGTACAAG AACTGCAACG GGGTGGTCAT GATGTTCGAC ATTACCAAGC 1680 

AGTGGACCTT CAATTACATT CTCCGGGAGC TTCCAAAAGT GCCCACCCAC GTGCCAGTGT 1740 

GCGTGCTGGG GAACTACCGG GACATGGGCG AGCACCGAGT CATCCTGCCG GACGACGTGC 1800 

GTGACTTCAT CGACAACCTG GACAGACCTC CAGGTTCCTC CTACTTCCGC TATGCTGAGT 1860 

CTTCCATGAA GAACAGCTTC GGCCTAAAGT ACCTTCATAA GTTCTTCAAT ATCCCATTTT 1920 

TGCAGCTTCA GAGGGAGACG CTGTTGCGGC AGCTGGAGAC GAACCAGCTG GACATGGACG 1980 

CCACGCTGGA GGAGCTGTCG GTGCAGCAGG AGACGGAGGA CCAGAACTAC GGCATCTTCC 2040 

TGGAGATGAT GGAGGCTCGC AGCCGTGGCC ATGCGTCCCC ACTGGCGGCC AACGGGCAGA 2100 

GCCCATCCCC GGGCTCCCAG TCACCAGTCC TGCCTGCACC CGCTGTGTCC ACGGGGAGCT 2160 

CCAGCCCCGG CACACCCCAG CCCGCCCCAC AGCTGCCCCT CAATGCTGCC CCACCATCCT 2220 

CTGTGCCCCC TGTACCACCC TCAGAGGCCC TGCCCCCACC TGCGTGCCCC TCAGCCCCCG 2280 

CCCCACGGCG CAGCATCATC TCTAGGCTGT TTGGGACGTC ACCTGCCACC GAGGCAGCCC 2340 

CTCCACCTCC AGAGCCAGTC CCGGCCGCAC AGGGCCCAGC AACGGTCCAG AGTGTGGAGG 2400 

ACTTTGTTCC TGACGACCGC CTGGACCGCA GCTTCCTGGA AGACACAACC CCCGCCAGGG 2460 

ACGAGAAGAA GGTGGGGGCC AAGGCTGCCC AGCAGGACAG TGACAGTGAT GGGGAGGCCC 2520 

TGGGCGGCAA CCCGATGGTG GCAGGGTTCC AGGACGATGT GGACCTCGAA GACCAGCCAC 2580 

GTGGGAGTCC CCCGCTGCCT GCAGGCCCCG TCCCCAGTCA AGACATCACT CTTTCGAGTG 2640 

AGGAGGAAGC AGAAGTGGCA GCTCCCACAA AAGGCCCTGC CCCAGCTCCC CAGCAGTGCT 2700 

CAGAGCCAGA GACCAAGTGG TCCTCCATAC CAGCTTCGAA GCCACGGAGG GGGACAGCTC 2760 

CCACGAGGAC CGCAGCACCC CCCTGGCCAG GCGGTGTCTC TGTTCGCACA GGTCCGGAGA 2820 

AGCGCAGCAG CACCAGGCCC CCTGCTGAGA TGGAGCCGGG GAAGGGTGAG CAGGCCTCCT 2880 

CGTCGGAGAG TGACCCCGAG GGACCCATTG CTGCACAAAT GCTGTCCTTC GTCATGGATG 2940 

ACCCCGACTT TGAGAGCGAG GGATCAGACA CACAGCGCAG GGCGGATGAC TTTCCCGTGC 3000 

GAGATGACCC CTCCGATGTG ACTGACGAGG ATGAGGGCCC TGCCGAGCCG CCCCCACCCC 3060 

CCAAGCTCCC TCTCCCCGCC TTCAGACTGA AGAATGACTC GGACCTCTTC GGGCTGGGGC 3120 

TGGAGGAGGC CGGACCCAAG GAGAGCAGTG AGGAAGGTAA GGAGGGCAAA ACCCCCTCTA 3180 

AGGAGAAGAA AAAAAAAACA AAAAGCTTCT CGAGAGTACT TCTAGAGCGG CCGCGGGCCC 3240 

ATCGATTTTC CACCCGGGTG GGGTACCAGG TAAGTGTACC CAATTCGCCC TATAGTGAGT 3300 

CGTATT 3306 

(2) INFORMATION FOR SEQ ID NO: 11: 

<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGCGGGGCCA GAGTGGGCTG 20 
(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:12: 



GCAGTCCTGG CCTGCGGATG 



20 



(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTCGACAGGA GAATTGGTTC 20 
(2) INFORMATION FOR SEQ ID NO:U: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:U: 

GCCTGGGTTC GGTGCGGGAC 20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTER I ST ICS : 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 
TGGTCGGGTG TTTGTGAGTG 20 
<2> INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCTCTTCCGT CTCCTCAGTG 20 
(2) INFORMATION FOR SEQ ID N0:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 



GGATTGCTAG TCTCACAGAC 



20 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:T8: 
TTAAGGGTGG CTGAAGGGAC 20 
(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ACCTTCCCTC CCTGTCACAG 20 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
TGGTCGGGTG TTTGTGAGTG 20 
(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

ACACCATTCC AGAAATTCAG 20 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

AAACTGCAGG TGGCTGAGTC 20 

(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GTCCTAATGT TTTCAGGGAG 



20 



(2) IN FORMAT tOH FOR SEO ID S'0:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
AAAACCTATG GTTACAATTC 20 
(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCCTAGACAT GGTTCAAGTG 20 
(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GATATAATTA GTTCTCCATC 20 
(2) INFORMATION FOR SEO ID N0:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
ATGCCTGTTC CAGGCTGCAC 20 
(2) INFORMATION FOR SEQ ID N0:28: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 
GGACGGCGAC CTCCACCCAC 20 
(2) INFORMATION FOR SEQ ID NO:29: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID KO:29: 
GGGCTCCTCC GACGCCTGAG 20 
(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 
AGTCTAGCCC TGGCCTTGAC 20 
(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
GTCACTGGGG ACTCCGGCAG 20 
(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CAGCTTTCCC TGGGCACATG 20 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
CACAGCTGTC TCAAGCCCAG 20 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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ACTGTTCCCC CTACATGATG 



20 



(2) INFORMATION FOR SEQ ID NO:35: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(6) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
ATCATATCCT CTTGCTGGTC 20 
(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
GTTCCCAGAG CTTGTCTGTG 20 
(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
GTTTGGCAGA CTCATAGTTG 20 
(2) INFORMATION FOR SEQ ID N0:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TAGCAGGGAG CCATGACCTG 20 
(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:39: 
CTTGGCGCCA GAAGCGAGAG 20 
(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 



CCTCTCTCTC TCTCTCTCTC 



20 



(2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 
TCCCCGCTGA TTCCGCCAAG 20 
(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:42: 
CTTTTTGAAT TCGGCACGAG 20 
(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
CCCCTGGTCC GCACCAGTTC 20 
(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GAGAAGGGTC GGGGCGGCAG 20 
(2) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
AAATCACATC GCGTCAACAC 20 



(2) INFORMATION FOR SEQ ID NO: 46: 



WO 99/58559 



PCT/US99/I0793 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TAAGAGAGTC ATAGTTACTC 20 
(2) INFORMATION FOR SEQ ID NO:47: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
GCTCTAGAAG TACTCTCGAG 

(2> INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48: 
ACTCTGGCCA TCAGGAGATC 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
CAGGCGTTGT AGATGTTCTG 20 
(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
AGTGGCAGGC AGAAGTAATG 

(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEO 10 N0:51: 



GGTTGGAGAA CTGGATGTAG 



20 



(2) INFORMATION FOR SEO ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
CTATTCAGAT GCAACGCCAG 20 
(2) INFORMATION FOR SEQ 10 NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
CCATGGCACA CAGAGCAGAC 20 
(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
GCTACCATGC AGAGACACAG 20 
<2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 
CAGGCTGACA AGAAAATCAG 20 
<2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GGCACGCATA GAGGAGAGAC 20 

<2) INFORMATION FOR SEO ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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<B> 
(O 
(0) 



TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID HO:57: 



TGGGTGATGC CTTTGCTGAC 



20 



(2) INFORMATION FOR SEQ ID N0:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
AAAACAAGAT CAAGGTGATG 20 
(2) INFORMATION FOR SEQ ID N0:59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:59: 
TTGCCCACAT TGCTATGGTG 20 
(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:60: 
GACCAAGATC AGAAGTAGAG 20 
(2) INFORMATION FOR SEQ ID NO:61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:61: 
CCCCTGGGCC AATGATGTTG 20 
(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:62: 



TCTTCCCACC ATAGCAATG 



19 
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(2) IN FORMAT ION FOR SEQ 10 N0:63: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 20 base pairs 
(S) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:63: 

TGGTCTTGGT GACCAATGTG 20 

(2) INFORMATION FOR SEQ ID N0:64: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:64: 
ACACCTCGGT GACCCCTGTG 20 
(2) INFORMATION FOR SEQ ID NO:65: fl 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:65: 
TCTCCAAGTT CGGCACAGTG 20 
(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:66: 
ACATGGGCTG CACTCACGAC 20 
(2) INFORMATION FOR SEQ ID N0:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:67: 
GATCCTCTGA ACCTGCAGAG 20 
(2) INFORMATION FOR SEQ ID N0:68: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 





WO 99/58559 



20 



PCT/US99/I0793 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 



GGAAATGAGG TGGGGCGATC 



20 



<2) 1 N FORMAT 1 OK* FOR SEQ ID KQ:6v; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 
CTTTGCCTTG GACAAGGATG 20 
(2) INFORMATION FOR SEQ ID NO:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GCACCTGCCA TTGGGGGTAG 20 
(2) INFORMATION FOR SEQ ID N0:71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:71 : 
GGTGGAAGCC ATTGACGGTG 20 
(2) INFORMATION FOR SEQ ID NO: 72: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:72: 
TGCGTCTCTC GTCGCTGCTG 20 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GCGGAAACTC TGTGGTGCTG 20 



(2) INFORMATION FOR SEQ ID NO: 74: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO:7A: 
AGGATTGCCT TCCTCTACTG 20 
(2) INFORMATION FOR SEQ ID N0:75: 

<U SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
TGTCTGTTTC ACCAGGGCAG 20 
<2) INFORMATION FOR SEQ ID NO:76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 
CCAGTGCCTC TATGCATGTC 20 
(2) INFORMATION FOR SEQ ID N0:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:77: 
AGGAAGCCCA CGCACACCAC 20 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

CCCTTTGTTC CCTGATCTTC 20 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
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CGCTCGGGAT CCAGGTCATC 



20 



(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
TCGAGGTTCA GAGCGTAGTG 20 
(2) INFORMATION FOR SEQ ID N0:81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 

TCTTGGATCT CTGGCACCTC 20 

(2) INFORMATION FOR SEQ ID NO:82: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
CCATCAGAGT GAAGGAGGAG 20 
(2) INFORMATION FOR SEQ ID N0:83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CCATCTTCCA CTGGTCAGAG 20 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:84: 
CTCCTTCTCT TGGATCTCTG 20 



(2) INFORMATION FOR SEQ ID NO:85: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 



T7ACTTCAGC ACTGTTAGTC 



20 



(2) INFORMATION FOR SEQ ID N0:86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
AGGGAGGTAG CTCAAAGCTC 20 
(2) INFORMATION FOR SEQ ID N0:87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:87: 
TGGGTCCACA GTTCGCACAG 20 
(2) INFORMATION FOR SEQ ID NO:88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

CAACTCTGTG ATGGCTCCAG 20 

<2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
AGCAGGGTTC TGTTCAAGAC 20 
(2) INFORMATION FOR SEQ ID N0:90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 
CCATTGGGTG CTAGTCTCTC 20 
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(2) INFORMATION FOR SEQ ID N0:91 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPCLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 
CAGCCATGCT GTCCCAGCAG 20 
<2> INFORMATION FOR SEQ ID N0:92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 
CTGGACCTGA GGTAGCGCTG 20 
(2) INFORMATION FOR SEQ ID NO:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO:93: 

ATAACCACCC TGAGGCACTG 20 

(2) INFORMATION FOR SEQ ID NO:94: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 
CCTGCAGGTC GACACTAGTG 20 
(2) INFORMATION FOR SEQ ID NO:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
AATTGGAATG AGGAGGACTG 20 
(2) INFORMATION FOR SEQ ID N0:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 
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Cxi) SEQUENCE DESCRIPTION : SEO 10 NO:96: 



GCTCTAGAAG TACTCTCGAG 



20 



(2) INFORMATION FOR SEO ID N0:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRAND EDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 
ATTGTATGAC AATGCACCAG 20 
(2) INFORMATION FOR SEQ ID N0:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

TCCACAGAGG GCTTCATCAC 20 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:99: 
CCTGACTGGC CTAAGCACAG 20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
AAGCCTCATA ACCACCAGTG 20 
(2) INFORMATION FOR SEO ID NO:101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:101: 
TGTCAACGGT GACAAGTGTG 20 
(2) INFORMATION FOR SEO ID NO:102: 
(i) SEQUENCE CHARACTERISTICS: 



C2) INFORMATION FOR SEQ ID NO: 100: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUEKCE DESCRIPTION: SEQ ID KO:102: 
TTGTACACCA GCTGCAGGTC 20 
<2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 
GGGTGTGGTG CAGATGAGTC 20 
(2) INFORMATION FOR SEQ ID NO:10A: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 
ATCACACTCT TATAGCTCAG 20 
(2> INFORMATION FOR SEQ ID NO:105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GTGGGAAGCT TTCCTCAGAC 20 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
TGATGAACAT GGGCCTGGAG 20 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
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CATTGTGGAT GTACTACCAC 



20 



(2) INFORMATION FOR SEO ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TGTGTTTTGC AACCTGAGTG 20 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY ; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
ATAGTGGCAC CACTTACGAG 20 
(2) INFORMATION FOR SEQ ID NO:110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: 
AATTCTGCAA CGTGATGGCG 20 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:111: 
CACAAGATGC CTCGTCTGTG 20 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:112: 

AATCCGGACA AGGTACAGTC 20 

(2) INFORMATION FOR SEQ ID NO:113: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDE0NESS: single 
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(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



GCACGAGTGG CACAAGCGTG 



?G 



(2) INFORMATION FOR SEQ ID NO:114: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:114: 
GCAAGCGTGT GGTGTCAGTG 20 
(2) INFORMATION FOR SEQ ID NO:115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:115: 
TGTTTGAACA GGCTCTGGAC 20 
(2) INFORMATION FOR SEQ ID NO: 1 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
CGGCATGGCA ATGAGGACAC 20 
(2) INFORMATION FOR SEQ ID NO:117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:117: 
AGGACGAGAT GGACCTCCAG 20 
(2) INFORMATION FOR SEQ ID NO:118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:118: 
CCCTCTGTCC TCTAGCCCAC 20 



(2) INFORMATION FOR SEQ ID NO: 1 19: 
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(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi > SEQUENCE DESCRIPTION: SEQ ID N0:119: 
TCTTGAGGGG ACTGACTCTG 20 
(2) INFORMATION FOR SEQ 10 NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:120: 
TGAGTGAGGA GGCAGATGTC 20 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 
TGGCTTTGAA GAAAGAGCTG 20 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GCAAAAGACC AGGCTGACTG 20 
(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 
TGCAGCTCCT TGGTCTTCTC 20 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



GATTCACAGT CCCAAGGCTC 



20 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 
ATCTGGATGA GGCGGTTGAG 20 
(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 
GGTCACTCTC CGACGAGGAG 20 
(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:127: 

GGATCCAAAG TTCGTCTCTG 20 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CGCTGTGTGT CTGATCCCTC 20 

(2) INFORMATION FOR SEQ ID NO: 129: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

ATGAAGGTAA ACCCCGGGAG 20 

(2) INFORMATION FOR SEQ ID NO:130: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(B) 
(C) 
(D) 



TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: S50 ID NO: 130: 



TGGTCTCTGG CTCTGAGCAC 



20 



(2) INFORMATION FOR SEQ ID NO: 131: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:131: 
GCCTGGAGAA GCCCAGTCTG 20 
(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 
CACACTCTGG ACCGTTGCTG 20 
(2) INFORMATION FOR SEQ ID NO:133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:133: 
AAAGCTCCGC AGCCGCAGTG 20 
(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
TCTTCCAGGA AGCTGCGGTC 20 
(2) INFORMATION FOR SEQ ID NO:135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 



GATGGTGGGG CAGCATTGAG 



20 
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(2) INFORMATION FOR SEQ 10 NO:136: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 

(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
GTCACCAGTG GTGCCTGCAG 

<2) INFORMATION FOR SEQ ID NO:137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
ACCTCACGGT TGCCAACCTG 20 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
CGCAACAGCG TCTCCCTCTG 20 
(2) INFORMATION FOR SEQ ID N0:139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
AGTACCTTCA TAAGTTCTTC 20 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:U0: 
TCCCAGACTT CAACCTTCAC 20 
(2) INFORMATION FOR SEQ ID NO:U1: 

<l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:U1: 
AAACATCTTC CCGGTCGGAC 

(2) INFORMATION FOR SEQ ID NO: ^2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:K2: 

GCTGAGCACC TTTACCTCAC 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(D> TOPOLOGY : linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO:143: 
GACGTCCGTC CGGGAAGATG 20 
(2) INFORMATION FOR SEQ ID NO:144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:144: 
ACACAGGAGA TGCAGGTCAC 

(2) INFORMATION FOR SEQ ID NO:K5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GAGTCTTCCA TGAAGAACAG 20 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
GCAGTGAGGA AGGTAAGGAG 20 
.(2) INFORMATION FOR SEQ ID NO: 147: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 378... 1799 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GGATCCAAAG GACGCCCCCG CCGACAGGAG AATTGGTTCC CGGGCCCGCG GCGATGCCCC 60 

CCCGGTAGCT CGGGCCCGTG GTCGGGTGTT TGTGAGTGTT TCTATGTGGG AGAAGGAGGA 120 

GGAGGAGGAA GAAGAAGCAA CGATTTGTCT TCTCGGCTGG TCTCCCCCCG GCTCTACATG 180 

TTCCCCGCAC TGAGGAGACG GAAGAGGAGC CGTAGCCGCC CCCCCTCCCG GCCCGGATTA 240 

TAGTCTCTCG CCACAGCGGC CTCGGCCTCC CCTTGGATTC AGACGCCGAT TCGCCCAGTG 300 

TTTGGGAAAT GGGAAGTAAT GACAGCTGGC ACCTGAACTA AGTACTTTTA TAGGCAACAC 360 

CATTCCAGAA ATTCAGG ATG AAT GGG GAT ATG CCC CAT GTC CCC ATT ACT 410 
Met Asn Gly Asp Met Pro His Val Pro lie Thr 
1 5 10 

ACT CTT GCG GGG ATT GCT AGT CTC ACA GAC CTC CTG AAC CAG CTG CCT 458 
Thr Leu Ala Gly He Ala Ser Leu Thr Asp Leu Leu Asn Gin Leu Pro 
15 20 25 

CTT CCA TCT CCT TTA CCT GCT ACA ACT ACA AAG AGC CTT CTC TTT AAT 506 
Leu Pro Ser Pro Leu Pro Ala Thr Thr Thr Lys Ser Leu Leu Phe Asn 
30 35 40 

GCA CGA ATA GCA GAA GAG GTG AAC TGC CTT TTG GCT TGT AGG GAT GAC 554 
Ala Arg lie Ala Glu Glu Val Asn Cys Leu Leu Ala Cys Arg Asp Asp 
45 50 55 

AAT TTG GTT TCA CAG CTT GTC CAT AGC CTC AAC CAG GTA TCA ACA GAT 602 
Asn Leu Val Ser Gin Leu Val His Ser Leu Asn Gin Val Ser Thr Asp 
60 65 70 75 

CAC ATA GAG TTG AAA GAT AAC CTT GGC AGT GAT GAC CCA GAA GGT GAC 650 
His lie Glu Leu Lys Asp Asn Leu Gly Ser Asp Asp Pro Glu Gly Asp 
80 85 90 

ATA CCA GTC TTG TTG CAG GCC GTC CTG GCA AGG AGT CCT AAT GTT TTC 698 
lie Pro Val Leu Leu Gin Ala Val Leu Ala Arg Ser Pro Asn Val Phe 
95 100 105 

AGG GAG AAA AGC ATG CAG AAC AGA TAT GTA CAA AGT GGA ATG ATG ATG 746 
Arg Glu Lys Ser Met Gin Asn Arg Tyr Val Gin Ser Gly Met Met Met 
110 115 120 

TCT CAG TAT AAA CTT TCT CAG AAT TCC ATG CAC AGT AGT CCT GCA TCT 794 
Ser Gin Tyr Lys Leu Ser Gin Asn Ser Met His Ser Ser Pro Ala Ser 
125 130 135 

TCC AAT TAT CAA CAA ACC ACT ATC TCA CAT AGC CCC TCC AGC CGG TTT 842 
Ser Asn Tyr Gin Gin Thr Thr He Ser His Ser Pro Ser Ser Arg Phe 
140 145 150 155 

GTG CCA CCA CAG ACA AGC TCT GGG AAC AGA TTT ATG CCA CAG CAA AAT 890 
Val Pro Pro Gin Thr Ser Ser Gly Asn Arg Phe Met Pro Gin Gin Asn 
160 165 170 

AGC CCA GTG CCT AGT CCA TAC GCC CCA CAA AGC CCT GCA GGA TAC ATG 938 
Ser Pro Val Pro Ser Pro Tyr Ala Pro Gin Ser Pro Ala Gly Tyr Met 
175 1 80 185 

CCA TAT TCC CAT CCT TCA AGT TAC ACA ACA CAT CCA CAG ATG CAA CAA . 986 
Pro Tyr Ser His Pro Ser Ser Tyr Thr Thr His Pro Gin Met Gin Gin 
190 195 200 
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GCA TCG GTA TCA ACT CCC ATT GTT GCA GGT GGT TTG AGA AAC ATA CAT 1034 
Ala Ser Vat Ser Ser Pro lie Val Ala Gly Gly Leu Arg Asn He His 
205 210 215 

GAT AAT AAA GTT TCT GGT CCG TTG TCT GGC AAT TCA GCT AAT CAT CAT 1082 

Asp Asn lys val Ser Gly Pro Leu Ser Gly Asn S«r Ala Asn His K,s 
220 225 230 

GCT GAT AAT CCT AGA CAT GGT TCA ACT GAG GAC TAC CTA CAC ATG GTG 
Ala Asp Asn Pro Arg His Gly Ser Ser Glu Asp Tyr Leu His Met Val 
240 245 250 

CAC AGG CTA ACT ACT GAC GAT GGA GAT TCT TCA ACA ATG AGG AAT GCT 
His Arg Leu Ser Ser Asp Asp Gly Asp Ser Ser Thr Met Arg Asn Ala 
255 260 

GCA TCT TTT CCC TTG AGA TCT CCA CAG CCA GTA TGC TCC CCT GCT GGA 
Ala Ser Phe Pro Leu Arg Ser Pro Gin Pro Val Cys Ser Pro Ala Gly 
270 275 280 

ACT GAA GGA ACT CCT AAA GGC TCA AGA CCA CCT TTA ATC CTA CAA TCT 
Ser Glu Gly Thr Pro Lys Gly Ser Arg Pro Pro Leu lie Leu Gin Ser 
285 290 295 

CAG TCT CTA CCT TGT TCA TCA CCT CGA GAT GTT CCA CCA GAT ATC TTG 
Gin Ser Leu Pro Cys Ser Ser Pro Arg Asp Val Pro Pro Asp He Leu 
300 305 310 ™ 

CTA GAT TCT CCA GAA AGA AAA CAA AAG AAG CAG AAG AAA ATG AAA TTA 
Leu Asp Ser Pro Glu Arg Lys Gin Lys Lys Gin Lys Lys Met Lys Leu 
320 325 

GGC AAG GAT GAA AAA GAG CAG AGT GAG AAA GCG GCA ATG TAT GAT ATA 
Gly Lys Asp Glu Lys Glu Gin Ser Glu Lys Ala Ala Met Tyr Asp He 
335 340 345 

ATT AGT TCT CCA TCC AAG GAC TCT ACT AAA CTT ACA TTA AGA CTT TCT 
He Ser Ser Pro Ser Lys Asp Ser Thr Lys Leu Thr Leu Arg Leu Ser 
350 355 360 

CGT GTA AGG TCT TCA GAC ATG GAC CAG CAA GAG GAT ATG ATT TCT GGT 
Arg Val Arg Ser Ser Asp Met Asp Gin Gin Glu Asp Met He Ser Gly 
365 370 375 

GTG GAA AAT AGC AAT GTT TCA GAA AAT GAT ATT CCT TTT AAT GTG CAG 
Val Glu Asn Ser Asn Val Ser Glu Asn Asp He Pro Phe Asn Val Gin 
380 385 390 395 

TAC CCA GGA CAG ACT TCA AAA ACA CCC ATT ACT CCA CAA GAT ATA AAC 
Tyr Pro Gly Gin Thr Ser Lys Thr Pro He Thr Pro Gin Asp He Asn 
400 405 410 

CGC CCA CTA AAT GCT GCT CAA TGT TTG TCG CAG CAA GAA CAA ACA GCA 
Arg Pro Leu Asn Ala Ala Gin Cys Leu Ser Gin Gin Glu Gin Thr Ala 
415 420 425 

TTC CTT CCA GCA AAT CAA GTG CCT GTT TTA CAA CAG AAC ACT TCA GTT 
Phe Leu Pro Ala Asn Gin Val Pro Val Leu Gin Gin Asn Thr Ser Val 
430 435 440 

GCT GCA AAA CAA CCC CAG ACC AAT AGT CAC AAA ACC TTG GTG CAG CCT 
Ala Ala Lys Gin Pro Gin Thr Asn Ser His Lys Thr Leu Val Gin Pro 
445 450 455 



1130 



1178 



1226 



1274 



1322 



1370 



1418 



1466 



1514 



1562 



1610 



1658 



1706 



1754 



GGA ACA GGC ATA GAG GTC TCA GCA GAG CTG CCC AAG GAC AAG ACC TAAGA 1804 
Gly Thr Gly He Glu Val Ser Ala Glu Leu Pro Lys Asp Lys Thr 
460 465 *™ 

TCCAGCAGGG AACTATGTAG TCACCCCGAG AGGCCCAGCT CTCTCCGTGA GCTCTGGGCC 1864 

TAGGGTGGGG GTGGTTGTTG GTTCTGCGCG CACTGTTCCC CCTACATGAT GGGTCCATCC 924 

CAGTTGGCTT CTCTCACTCG CTTCCTCCTG TGGAGAAGCC TGTCCAGGTG TCACTGCCTC 1984 

CAGGAAGCTG TCTCTGATTT CTCCAGTTGA ACAGTGAGAT TTGCCACACX TCACAT6CAT 2044 

CGCTCTTGTC CCTGGAATTG TAACCATAGG TTTTCCTGTC TCCTGGAGGA CAAGGATGAG 2104 
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GGCTTTCCAC TTGAGTCTCC CTGGTGGAGC CCAGCTCCTG ACATACCTGG TAAAAGTTCT 2164 

CAAGAGAAGA ACATGGAGGA GGAATGTGGA TAACAACCCT GGCTGCCTGT 6TGTTCCAAG 2224 

CTAGGAAGAT GTAATGTCCC CACAAACGGG GTAAATGGCT TGCCTGCGTC ACAGCTGTCT 2284 

CAAGCCCAGG CCCTGGGCGC CAGCCCAAGC CCAAGGACTA GGTCCAGAGC CACACAGCGC 2344 

C^GCCACA? CCGCCTCACC TGGGACCCTT TGTGGGGTAC AGTCTCCGGC CCCACCCAGA 2404 

CCTCCTGAAG GAGAGACCCC ATGGCAAGGA CTCAfCCACC TGCAGTTTCA ^AGCCCCCn 2464 

GTGGGTTCCT AGGCATGAAG ACCACCGGTT AGAGGCTGAA CTGGCAGGAA CCTGTCTCCA *52h 

GCCCCTTCTC ACCCCAGCCG GGCCCTGCCT CAGAGGCAGC ACCCAGGACG TGGCCATGAC 2584 

ccgStc cactcaatcc ctcttctcca ggagccatgc AAAGTGTCAG CCAGCCAGGC 2644 

CCCTGGAAGG CAGTCATCAC CTCTTAAGGC ATTGTGGGTG TCGGTCCTGC AACTGCCAGG 2704 

TGCAGCACAC GACCCGTGTC CGGTGTTCGA TAGCAGGGAG CCATGACCTG GCAACGATTC 2764 

CACGCTCAAA GGGGCACCCG GGGGGCCCTG GGTCGGGGCG GATCAGCTTT CCCTGGGCAC 2824 

ATC?GCC^ TTCCAGATCT CCAGGGCTCA TGTCTGTGAC AGGGAGGGAA GGCTCTGCCC 2884 

TGGCCTTCCG TCAGCTCTGC CAGTGCAGGC TGGGCAGCCT GGGCTTTAGA GCTGGCTTCT 2944 

GCCCACACTT TCTCCGTGAA AGGAAAACAA CTATGAGTCT GCCAAACGCA TCTCAGATGC 30O4 

GTTTTAAAAA ATTCTGGTCC CCGCTCTCTG TCCCATCATC CGCCTCGGGG ACTTCCTCTC 3064 

TCCGTGGTTC TCACCCCATA CTCTGTCACT GCCACATTTT CACCTGGGCC TGGCCTTTGT 3 24 

CTCCACCTGA AACTCCTGAA AATCTTGAAA TGGATTTCTA GGTCACTGGG GACTCCGGCA 3184 

GCACATTCGG CTTCAGAATA AAGGGCGCCC GCGGTCCCCC AGCACCTCCC CAAGCCACAC 3244 

CCCTAGCTTC CCTCCCTATC CCTGCAGCCT GAGGGTCCCT TCAGCCACCC TTAAGTCCCC 3304 

ACCTGGGCTC CTGCCCCGCC CCTGGCTAGC AGCGCCTTCT CCACCGGGGC CCCCTCTGCT 3364 

CACAGAGCCC CCTCACCTCC CTGGGGATGA GGGGCCAGGC CATGACCCTG AAAGTCTAGC 3424 

CC^CTTG ACCTCCCAGG AGCGCCCTCC CCGCCCTCTC CCGGCCCCGG CCCCGTCCTC 3484 

TGCTGCTGGC CTCTGGGTCG TGCCCCGCAG ACTGAGCTGC GCTTGGGGGT CCTGGCGGCC 3544 

TGGGCCGTCC CGCACCGAAC CCAGGCGGTC GGAGCCCGGC GGGAAGGCGC GAGGTCCTTC 3604 

TGGGGGCTCC TCCGACGCCT GAGGGCGCTG CTTCCCCGCG GCCGCCCCGG GTTTCTGCGG 3664 

AGCCGGGGCC TCCGCTCTCG GGTGACCCGG TGAGACCCCC GGGGAGGCCG CTGGGGAGGC 3724 

GCGGGCTCTG CTCCCGGGTC CCAAACGCAC TGGCTGCCCC TCAGGAGGGA CGGCGACCTC 3784 

CACCCACGGC GCTGGCGCCC GCACGGCCGC TCCTCCCGCT CCCGCAGCCT GGACGCCTCC 3844 

CGAGGCCGCC CCGCCGGGCC CCACGCGCGG CCCCATCCGC AGGCCAGGAC TGCCTTCCCG 3904 

GAGCTGGCGG CCCCCAGCCT GGAGGAGCCG GCCCCAGACG CCCTCCCAGC CCTCCCCAGC 3964 

CCACTCTGGC CCCGCAGCCC CCGCCTGGTC CGAGTGCGGG TCTCTGGCCC CGGCCTTTCC 4024 
CGGGGAAGGA AAGCAAAAAG CTT 

<2) INFORMATION FOR SEQ ID NO:148: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 amino acids 
<B) TYPE: amino acid 

(C) STRANDEONESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:148: 

Met Asn Gly Asp Met Pro His Val Pro He Thr Thr Leu Ala Gly He 

15 10 15 

Ala Ser Leu Thr Asp Leu Leu Asn Gin Leu Pro Leu Pro Ser Pro Leu 

20 25 30 

Pro Ala Thr Thr Thr Lys Ser Leu Leu Phe Asn Ala Arg lie Ala Glu 

35 40 45 

Glu Val Asn Cys Leu Leu Ala Cys Arg Asp Asp Asn Leu Val Ser Gin 

50 55 60 

Leu Val His Ser Leu Asn Gin Val Ser Thr Asp His lie Glu Leu Lys 
65 70 75 80 

Asp Asn Leu Gly Ser Asp Asp Pro Glu Gly Asp lie Pro Val Leu Leu 

85 90 95 

Gin Ala Val Leu Ala Arg Ser Pro Asn Val Phe Arg Glu Lys Ser Met 

100 105 110 

Gin Asn Arg Tyr Val Gin Ser Gly Met Met Met Ser Gin Tyr Lys Leu 

115 120 125 

Ser Gin Asn Ser Met His Ser Ser Pro Ala Ser Ser Asn Tyr Gin Gin 
130 135 1^0 



130 

Thr Thr He Ser His Ser Pro Ser Ser Arg Phe Val Pro Pro Gin Thr 
H5 150 155 160 

Ser Ser Gly Asn Arg Phe Met Pro Gin Gin Asn Ser Pro Val Pro Ser 

165 170 175 

Pro Tyr Ala Pro Gin Ser Pro Ala Gly Tyr Met Pro Tyr Ser His Pro 

180 185 190 

Ser Ser Tyr Thr Thr His Pro Gin Met Gin Gin Ala Ser Val Ser Ser 
195 200 205 
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Pro lie Val Ala Gly Gly Leu Arg Asn lie His Asp Asn Lys Val Ser 

210 215 220 

Gly Pro Leu Ser Gly Asn Ser Ala Asn His His Ala Asp Asn Pro Arg 
225 230 235 2*0 

His Gly Ser Ser Glu Asp Tyr Leu His Met Val His Arg Leu Ser Ser 

2i5 250 255 

Asp Asp Gly Asp Ser Ser Thr Met Arg Asn Ala Ala Ser Phe Pro Leu 

260 265 270 

Arg Ser Pro Gin Pro Val Cys Ser Pro Ala Gly Ser Glu Gly Thr Pro 

275 280 285 

Lys Gly Ser Arg Pro Pro Leu lie Leu Gin Ser Gin Ser Leu Pro Cys 

290 2°5 300 

Ser Ser Pro Arg Asp Val Pro Pro Asp lie Leu Leu Asp Ser Pro Glu 
305 310 315 320 

Arg Lys Gin Lys Lys Gin Lys Lys Met Lys Leu Gly Lys Asp Glu Lys 

325 330 335 

Glu Gin Ser Glu Lys Ala Ala Met Tyr Asp lie He Ser Ser Pro Ser 

340 345 350 

Lys Asp Ser Thr Lys Leu Thr Leu Arg Leu Ser Arg Val Arg Ser Ser 

355 360 365 

Asp Met Asp Gin Gin Glu Asp Met He Ser Gly Val Glu Asn Ser Asn 

370 375 380 

Val Ser Glu Asn Asp lie Pro Phe Asn Val Gin Tyr Pro Gly Gin Thr 
385 390 395 400 

Ser Lys Thr Pro lie Thr Pro Gin Asp He Asn Arg Pro Leu Asn Ala 

405 410 415 

Ala Gin Cys Leu Ser Gin Gin Glu Gin Thr Ala Phe Leu Pro Ala Asn 

420 425 430 

Gin Val Pro Val Leu Gin Gin Asn Thr Ser Val Ala Ala Lys Gin Pro 

435 440 445 

Gin Thr Asn Ser His Lys Thr Leu Val Gin Pro Gly Thr Gly He Glu 

450 455 460 

Val Ser Ala Glu Leu Pro Lys Asp Lys Thr 
465 470 

(2) INFORMATION FOR SEQ ID N0:149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2998 base pairs 

(B) TYPE: nucleic acid 

(C) STRAN0E0NESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 
(8) LOCATION: 26... 799 
(D) OTHER INFORMATION: 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:149: 

AAGCTTTTTG AATTCGGCAC GAGAT GCT ACA CAG GCT ATA TTT GAA ATA CTG 52 

Ala Thr Gin Ala He Phe Glu He Leu 
1 5 

GAG AAA TCC TGG TTG CCC CAG AAT TGT ACA CTG GTT GAT ATG AAG ATT 100 
Glu Lys Ser Trp Leu Pro Gin Asn Cys Thr Leu Val Asp Met Lys He 
10 15 20 25 

GAA TTT GGT GTT GAT GTA ACC ACC AAA GAA ATT GTT CTT GCT GAT GTT 148 
Glu Phe Gly Val Asp Val Thr Thr Lys Glu lie Val Leu Ala Asp Val 
30 35 40 

ATT GAC AAT GAT TCC TGG AGA CTC TGG CCA TCA GGA GAT CGA AGC CAA 196 
lie Asp Asn Asp Ser Trp Arg Leu Trp Pro Ser Gly Asp Arg Ser Gin 
45 50 55 

CAG AAA GAC AAA CAG TCT TAT CGG GAC CTC AAA GAA GTA ACT CCT GAA 244 
Gin Lys Asp Lys Gin Ser Tyr Arg Asp Leu Lys Glu Val Thr Pro Glu 
60 65 70 



GGG CTC CAA ATG GTA AAG AAA AAC TTT GAG TGG GTT GCA GAG AGA GTA 



292 
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Gly Leu Gin Met Vai Lys Lys Asn Phe GLu Trp Val Ala Gtu Arg Val 
75 80 85 

GAG TTG CTT TTG AAA TCA GAA AGT CAG TGC AGG GTT GTA GTG TTG ATG 340 
Glu Leu Leu Leu Lys Ser Glu Ser Gin Cys Arg Val Val Val Leu Met 

90 95 100 105 

GGC TCT ACT TCT GAT CTT GGT CAC TGT GAA AAA ATC AAG AAG GCC TGT 388 
Gly Ser Thr Ser Asp Leu Gly His Cys Glu Lys lie Lys Lys Ala Cys 
110 115 120 

GGA AAT TTT GGC ATT CCA TGT GAA CTT CGA GTA ACA TCT GCG CAT AAA 436 
Gly Asn Phe Gly lie Pro Cys Glu Leu Arg Val Thr Ser Ala His Lys 
125 130 135 

GGA CCA GAT GAA ACT CTG AGG ATT AAA GCT GAG TAT GAA GGG GAT GGC 484 
Gly Pro Asp Glu Thr Leu Arg lie Lys Ala Glu Tyr Glu Gly Asp Gly 
140 145 150 

ATT CCT ACT GTA TTT GTG GCA GTG GCA GGC AGA AGT AAT GGT TTG GGA 532 
lie Pro Thr Val Phe Val Ala Val Ala Gly Arg Ser Asn Gly Leu Gly 
155 160 165 

CCA GTG ATG TCT GGG AAC ACT GCA TAT CCA GTT ATC AGC TGT CCT CCC 580 
Pro Val Met Ser Gly Asn Thr Ala Tyr Pro Val lie Ser Cys Pro Pro 
170 175 180 185 

CTC ACA CCA GAC TGG GGA GTT CAG GAT GTG TGG TCT TCT CTT CGA CTA 628 
Leu Thr Pro Asp Trp Gly Val Gin Asp Val Trp Ser Ser Leu Arg Leu 
190 195 200 

CCC AGT GGT CTT GGC TGT TCA ACC GTA CTT TCT CCA GAA GGA TCA GCT 676 
Pro Ser Gly Leu Gly Cys Ser Thr Val Leu Ser Pro Glu Gly Ser Ala 
205 210 215 

CAA TTT GCT GCT CAG ATA TTT GGG TTA AGC AAC CAT TTG GTA TGG AGC 724 
Gin Phe Ala Ala Gin lie Phe Gly Leu Ser Asn His Leu Val Trp Ser 
220 225 230 

AAA CTG CGA GCA AGC ATT TTG AAC ACA TGG ATT TCC TTG AAG CAG GCT 772 
Lys Leu Arg Ala Ser lie Leu Asn Thr Trp lie Ser Leu Lys Gin Ala 
235 240 245 

GAC AAG AAA ATC AGA GAA TGT AAT TTA TAAGAAAGAA TGCCATTGAA TTTTTTA 826 
Asp Lys Lys lie Arg Glu Cys Asn Leu 
250 255 

GGGGAAAAAC TACAAATTTC TAATTTAGCT GAAGGAAAAT CAAGCAAGAT GAAAAGGTAA 886 

TTTTAAATTA GAGAACACAA ATAAAATGTA TTAGTGAATA AATGGTGAGG GTAGGCCTAT 946 

TCAGATGCAA GGCCAGCAAT GGGGCTCCCC ATTATCCCCA CCCCTTTGGT CCCAGTCCCC 1006 

TTCTCTGCAA TGGGCACGCA TAGAGGAGAG ACAAAGGGTA TTAGACGCAA CATCATTGGC 1066 

CCAGGGGAGT CCGAGAAGAG CTGCCATTGG CTGACAGGGC ATTTTCAGGC TCTGTCATTG 1126 

GTCAGGGAGC ACACCCCAGC CTGAAGAGTG ATGCCATTGG CCAGGGAGTG GTTTTGTCAT 1186 

AGCCGTTGGC TGTGAAGTGG AAGGAAAAGA TCTGGGAATG AAGCCCTGTG GCCAGGAAGA 1246 

TAGACAGGGC AGCAACTTCT GGGCCTCCAG GCCCTCTTCC CACCATAGCA ATGTGGGCAA 1306 

AACTGGTGTC AGGCCCCAGC CAGAAAAAGG AGCCCAAGCC AGAGGGCAAG TGACAAAGGA 1366 

TGTACCATGT CCAATCTCCC ACACCCTGGG GCTGCCCTTC CCAATGTCTT TCTTGATAGC 1426 

CAAGTTGGGC TGGGAGCAGC TCACTGCTCC TCTAGCCAGG AGGGTTTCTC AGCTCCTGGA 1486 

GGCCGCAGCT TGATGTTGAA CTCCTGCAGG GTCTGCTCCA GCTGTTTCTG GTTCCCAGCA 1546 

AAGTAGGCGG ACACAGCATT GTGGAAGAGC AGCAGCTGCT TGTGCATCAC CTTGATCTTG 1606 

TTTTCTTCCA GGAACTTGAG CTTGATGGCC ACATCTCCCC GCAGCTTCTC ATACTTGTCC 1666 

CGATGGGCCT GGAAAGTGGC CTGGGCACTC TCAAGTCGAC CACGTGTCCC TGCATCCCGG 1726 

GGGCCTAGAC TCAGCTCCTC TAAGTCTGTT CGGTAGGCAT CATATTCCAG CCTGGCAGCC 1786 

TCATACTGTT TCACAGTCAT GAGCGTGTCT TCCATGGTCT TGGTGACCAA TGTGTTGATG 1846 

CTAGAGACAA AGAAGTTCAC GGCTCCTAGC AGCGTTTCCC CATTCTTGCA TAGTAGTTTC 1906 

TGTGTCTCTG CATTGTAGCC AAATTCCTCC TGAAGCTCTG GGGACTTCTG GCTGAGGTCA 1966 

GCAAAGGCAT CACCCAGTGC ATGCTGGGTC TGCAGCAGGC TGTAGAGGTG GGCTGTCAGT 2026 

GCCCGGCCCA GCTGCAGGAC ACTCTCATAC TTGCGCTTCG TCTCACGCAG CAACTCAATC 2086 

TGCAGCTCTA GCTCCAGGAT TCCGGCGCCT CCACTCCGTC CCCCGCGGGT CTGCTCTGTG 2146 

TGCCATGGAC GGCATTGTCC CA GAT AT AGC CGTTGGTACA AAGCGGGGAT CTGACGAGCT 2206 

TTTCTCTACT TGTGTCACTA ACGGACCGTT TATCATGAGC AGCAACTCGG CTTCTGCAGC 2266 

AAACGGAAAT GACAGCAAGA AGTTCAAAGG TGACAGCCGA AGTGCAGGCG TCCCCTCTAG 2326 

AGTGATCCAC ATCCGGAAGC T CCC CAT CGA CGTCACGGAG GGGGAAGTCA TCTCCCTGGG 2386 
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GCTGCCCTTT GGGAAGGTCA CCAACCTCCT GATGCTGAAG GGGAAAAACC AGGCCTTCAT 2446 

CGAGATGAAC ACGGAGGAGG CTGCCAATAC CATGGTGAAC TACTACACCT CGGTGACCCC 2506 

TGTGCTGCGC GGCCAGCCCA TCTACATCCA GTTCTCCAAC CACAAGGAGC TGAAGACCGA 2566 

CAGCTCTCCC AACCAGGCGC GGGCCCAGGC GGCCCTGCAG GCGGTGAACT CGGTCCAGTC 2626 

GGGGAACCTG GCCTTGGCTG CCTCGGCGGC GGCCGTGGAT GCAGGGATGG CGATGGCCGG 2686 

GCAGAGCCCC GTGCTCAGGA TCATCGTGGA GAACCTCTTC TACCCTGTGA CCCTGGATGT 27^6 

GCTGCACCAG ATTTTCTCCA AGTTCGGCAC AG7GTTGAAG ATCATCACCT TCACCAAGAA 2806 

CAACCAGTTC CAGGCCCTGC TGCAGTATGC GGACCCCGTG AGCGCCCAGC ACGCCAAGCT 2866 

GTCGCTGGAC GGGCAGAACA TCTACAACGC CTGCTGCACG CTGCGCATCG ACTTTTCCAA 2926 

GCTCACCAGC CTCAACGTCA AGTACAACAA TGACAAGAGC CGTGACTACC TCGTGCCGAA 2986 

TTCTTTGGAT CC 2998 

(2) INFORMATION FOR SEQ ID N0:150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:150: 



Ala 


Thr Gin 


Ala 


He 


Phe 


Glu 


He 


Leu 


Glu 


Lys 


Ser Trp Leu Pro Gin 


1 






5 










10 




15 


Asn 


Cys Thr 


Leu 


Val 


Asp 


Met 


Lys 


He 


Glu 


Phe 


Gly Val Asp Val Thr 






20 










25 






30 


Thr 


Lys Gtu 


He 


Val 


Leu 


Ala 


Asp 


Val 


He 


Asp 


Asn Asp Ser Trp Arg 




35 










40 








45 


Leu 


Trp Pro 


Ser 


Gly 


Asp 


Arg 


Ser 


Gin 


Gin 


Lys 


Asp Lys Gin Ser Tyr 




50 








55 










60 


Arg 


Asp Leu 


Lys 


Glu 


Val 


Thr 


Pro 


Glu 


Gly 


Leu 


Gin Met Val Lys Lys 


65 








70 










75 


80 


Asn 


Phe Glu 


Trp 


Val 


Ala 


Glu 


Arg 


Val 


Glu 


Leu 


Leu Leu Lys Ser Glu 








85 










90 




95 


Ser 


Gin Cys 


Arg 


Val 


Val 


Val 


Leu 


Met 


Gly 


Ser 


Thr Ser Asp Leu Gly 






100 










105 






110 


His 


Cys Glu 


Lys 


lie 


Lys 


Lys 


Ala 


Cys 


Gly 


Asn 


Phe Gly He Pro Cys 




115 










120 








125 


Glu 


Leu Arg 


Val 


Thr 


Ser 


Ala 


His 


Lys 


Gly 


Pro 


Asp Glu Thr Leu Arg 




130 








135 










140 


He 


Lys Ala 


Glu 


Tyr 


Glu 


Gly 


Asp 


Gly 


He 


Pro 


Thr Val Phe Val Ala 


145 








150 










155 


160 


Val 


Ala Gly 


Arg 


Ser 


Asn 


Gly 


Leu 


Gly 


Pro 


Val 


Met Ser Gly Asn Thr 








165 










170 




175 


Ala 


Tyr Pro 


Val 


He 


Ser 


Cys 


Pro 


Pro 


Leu 


Thr 


Pro Asp Trp Gly Val 






180 










185 






190 


Gin 


Asp Val 


Trp 


Ser 


Ser 


Leu 


Arg 


Leu 


Pro 


Ser 


Gly Leu Gly Cys Ser 




195 










200 








205 


Thr 


Val Leu 


Ser 


Pro 


Glu 


Gly 


Ser 


Ala 


Gin 


Phe 


Ala Ala Gin He Phe 




210 








215 










220 


Gly 


Leu Ser 


Asn 


His 


Leu 


Val 


Trp 


Ser 


Lys 


Leu 


Arg Ala Ser He Leu 


225 








230 










235 


240 


Asn 


Thr Trp 


lie 


Ser 


Leu 


Lys 


Gin 


Ala 


Asp 


Lys 


Lys He Arg Glu Cys 








245 










250 




255 


Asn 


Leu 





















(2) INFORMATION FOR SEO ID N0:151: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



He Gin Arg Phe Gly Thr Ser Gly His He Met Asn Leu Gin Ala Gin 
1 5 10 . 15 
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Pro Lys Ala Gin Asn Lys Arg Lys Arg Cys Leu Phe Gly Gly Gin Glu 

20 25 30 

Pro Ala Pro Lys Glu Gin Pro Pro Pro Leu Gin Pro Pro Gin Gin Ser 

35 40 45 

He Arg Val Lys Glu Glu Gin Tyr Leu Gly His Glu Gly Pro Gly Gly 

50 55 60 

Ala Val Ser Thr Ser Gin Fro Vat Glu Leu Pro Pro Pro Ser Ser Leu 
65 70 75 80 

Ala Leu Leu Asn Ser Val Val Tyr Gly Pro Glu Arg Thr Ser Ala Ala 

85 90 95 

Met Leu Ser Gin Gin Val Ala Ser Val Lys Trp Pro Asn Ser Val Met 

100 105 110 

Ala Pro Gly Arg Gly Pro Glu Arg Gly Gly Gly Gly Gly Val Ser Asp 

115 120 125 

Ser Ser Trp Gin Gin Gin Pro Gly Gin Pro Pro Pro His Ser Thr Trp 

130 135 140 

Asn Cys His Ser Leu Ser Leu Tyr Ser Ala Thr Lys Gly Ser Pro His 
145 150 155 160 

Pro Gly Val Gly Val Pro Thr Tyr Tyr Asn His Pro Glu Ala Leu Lys 

165 170 175 

Arg Glu Lys Ala Gly Gly Pro Gin Leu Asp. Arg Tyr Val Arg Pro Met 

180 185 190 

Met Pro Gin Lys Val Gin Leu Glu Val Gly Arg Pro Gin Ala Pro Leu 

195 200 205 

Asn Ser Phe His Ala Ala Lys Lys Pro Pro Asn Gin Ser Leu Pro Leu 

210 215 220 

Gin Pro Phe Gin Leu Ala Phe Gly His Gin Val Asn Arg Gin Val Phe 
225 230 235 240 

Arg Gin Gly Pro Pro Pro Pro Asn Pro Val Ala Ala Phe Pro Pro Gin 

245 250 255 

Lys Gin Gin Gin Gin Gin Gin Pro Gin Gin Gin Gin Gin Gin Gin Gin 

260 265 270 

Ala Ala Leu Pro Gin Met Pro Leu Phe Glu Asn Phe Tyr Ser Met Pro 

275 280 285 

Gin Gin Pro Ser Gin Gin Pro Gin Asp Phe Gly Leu Gin Pro Ala Gly 

290 295 300 

Pro Leu Gly Gin Ser His Leu Ala His His Ser Met Ala Pro Tyr Pro 
305 310 315 320 

Phe Pro Pro Asn Pro Asp Met Asn Pro Glu Leu Arg Lys Ala Leu Leu 

325 330 335 

Gin Asp Ser Ala Pro Gin Pro Ala Leu Pro Gin Val Gin He Pro Phe 

340 345 350 

Pro Arg Arg Ser Arg Arg Leu Ser Lys Glu Gly lie Leu Pro Pro Ser 

355 360 365 

Ala Leu Asp Gly Ala Gly Thr Gin Pro Gly Gin Glu Ala Thr Gly Asn 

370 375 380 

Leu Phe Leu His His Trp Pro Leu Gin Gin Pro Pro Pro Gly Ser Leu 
385 390 395 400 

Gly Gin Pro His Pro Glu Ala Leu Gly Phe Pro Leu Glu Leu Arg Glu 

405 410 415 

Ser Gin Leu Leu Pro Asp Gly Glu Arg Leu Ala Pro Asn Gly Arg Glu 

420 425 430 

Arg Glu Ala Pro Ala Met Gly Ser Glu Glu Gly Met Arg Ala Val Ser 

435 440 445 

Thr Gly Asp Cys Gly Gin Val Leu Arg Gly Gly Val He Gin Ser Thr 

450 455 460 

Arg Arg Arg Arg Arg Ala Ser Gin Glu Ala Asn Leu Leu Thr Leu Ala 
465 470 475 480 

Gin Lys Ala Val Glu Leu Ala Ser Leu Gin Asn Ala Lys Asp Gly Ser 

485 490 495 

Gly Ser Glu Glu Lys Arg Lys Ser Val Leu Ala Ser Thr Thr Lys Cys 

500 505 510 

Gly Val Glu Phe Ser Glu Pro Ser Leu Ala Thr Lys Arg Ala Arg Glu 

515 520 525 

Asp Ser Gly Met Val Pro Leu He He Pro Val Ser Val Pro Val Arg 

530 535 540 

Thr Val Asp Pro Thr Glu Ala Ala Gin Ala Gly Gly Leu Asp Glu Asp 
545 550 555 560 

Gly Lys Gly Leu Glu Gin Asn Pro Ala Glu His Lys Pro Ser Val He 

565 570 575 

Val Thr Arg Arg Arg Ser Thr Arg He Pro Gly Thr Asp Ala Gin Ala 

580 585 590 

Gin Ala Glu Asp Met Asn Val Lys Leu Glu Gly Glu Pro Ser Val Arg 
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595 



600 



605 



Lys Pro Lys Gin Arg Pro Arg Pro Glu Pro Leu He lie Pro Thr Lys 

610 615 620 

Ala Gly Thr Phe lie Ala Pro Pro Val Tyr Ser Asn lie Thr Pro Tyr 
625 630 635 640 

Gin Ser His Leu Aro Ser Pro Val Arg Leu Ala Asp His Pro Ser Glu 

645 650 655 

Arg Ser Phe Glu Leu Pro Pro Tyr Thr Pro Pro Pro lie Leu Ser Pro 

660 665 670 

Val Arg Glu Gly Ser Gly Leu Tyr Phe Asn Ala lie He Ser Thr Ser 

675 680 685 

Thr lie Pro Ala Pro Pro Pro lie Thr Pro Lys Ser Ala His Arg Thr 

690 695 700 

Leu Leu Arg Thr Asn Ser Ala Glu Val Thr Pro Pro Val Leu Ser Val 
705 710 715 720 

Met Gly Gtu Ala Thr Pro Val Ser He Glu Pro Arg He Asn Val Gly 

725 730 735 

Ser Arg Phe Gin Ala Glu He Pro Leu Met Arg Asp Arg Ala Leu Ala 

740 745 750 

Ala Ala Asp Pro His Lys Ala Asp Leu Val Trp Gin Pro Trp Glu Asp 

755 760 765 

Leu Glu Ser Ser Arg Glu Lys Gin Arg Gin Val Glu Asp Leu Leu Thr 

770 775 780 

Ala Ala Cys Ser Ser He Phe Pro Gly Ala Gly Thr Asn Gin Glu Leu 
785 790 795 800 

Ala Leu His Cys Leu His Glu Ser Arg Gly Asp He Leu Glu Thr Leu 

805 810 815 

Asn Lys Leu Leu Leu Lys Lys Pro Leu Arg Pro His Asn His Pro Leu 

620 825 830 

Ala Thr Tyr His Tyr Thr Gly Ser Asp Gin Trp Lys Met Ala Glu Arg 

835 840 845 

Lys Leu Phe Asn Lys Gly He Ala He Tyr Lys Lys Asp Phe Phe Leu 

850 855 860 

Val Gin Lys Leu He Gin Thr Lys Thr Val Ala Gin Cys Val Glu Phe 
865 870 875 880 

Tyr Tyr Thr Tyr Lys Lys Gin Val Lys He Gly Arg Asn Gly Thr Leu 

885 890 895 

Thr Phe Gly Asp Val Asp Thr Ser Asp Glu Lys Ser Ala Gin Glu Glu 

900 905 910 

Val Glu Val Asp He Lys Thr Ser Gin Lys Phe Pro Arg Val Pro Leu 

915 920 925 

Pro Arg Arg Glu Ser Pro Ser Glu Glu Arg Leu Glu Pro Lys Arg Glu 

930 935 940 

Val Lys Glu Pro Arg Lys Glu Gly Glu Glu Glu Val Pro Glu He Gin 
945 950 955 960 

Glu Lys Glu Glu Gin Glu Glu Gly Arg Glu Arg Ser Arg Arg Ala Ala 

965 970 975 

Ala Val Lys Ala Thr Gin Thr Leu Gin Ala Asn Glu Ser Ala Ser Asp 

980 985 990 

He Leu He Leu Arg Ser His Glu Ser Asn Ala Pro Gly Ser Ala Gly 

995 1000 1005 

Gly Gin Ala Ser Glu Lys Pro Arg Glu Gly Thr Gly Lys Ser Arg Arg 

1010 1015 1020 

Ala Leu Pro Phe Ser Glu Lys Lys Lys Lys Lys Gin Lys Ala 
1025 1030 1035 

(2) INFORMATION FOR SEC ID N0:152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:152: 

He Arg His Glu Val Ser Phe Leu Trp Asn Thr Glu Ala Ala Cys Pro 
15 10 15 

He Gin Thr Thr Thr Asp Thr Asp Gin Ala Cys Ser He Arg Asp Pro 

20 25 30 

Asn Ser Gly Phe Val Phe Asn Leu Asn Pro Leu Asn Ser Ser Gin Gly 
35 40 45 
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Tyr Asn Val Ser Gly He Gly Lys He Phe Met Phe Asn Val Cys Gly 

50 55 60 

Thr Met Pro Val Cys Gly Thr He Leu Gly Lys Pro Ala Ser Gly Cys 
65 70 75 80 

Glu Ala Glu Thr Gin Thr Glu Glu Leu Lys Asn Trp Lys Pro Ala Arg 

85 90 95 

Pro Vat Gty lie Glu Lys Ser Leu Gin Leu Ser Thr Glu Gly Phe lie 

100 105 110 

Thr Leu Thr Tyr Lys Gly Pro Leu Ser Ala Lys Gly Thr Ala Asp Ala 

115 120 125 

Phe He Val Arg Phe Val Cys Asn Asp Asp Val Tyr Ser Gty Pro Leu 

130 135 140 

Lys Phe Leu His Gin Asp He Asp Ser Gly Gin Gly He Arg Asn Thr 
145 150 155 160 

Tyr Phe Glu Phe Glu Thr Ala Leu Ala Cys Val Pro Ser Pro Val Asp 

165 170 175 

Cys Gin Val Thr Asp Leu Ala Gly Asn Glu Tyr Asp Leu Thr Gly Leu 

180 185 190 

Ser Thr Val Arg Lys Pro Trp Thr Ala Val Asp Thr Ser Val Asp Gly 

195 200 205 

Arg Lys Arg Thr Phe Tyr Leu Ser Val Cys Asn Pro Leu Pro Tyr He 

210 215 220 

Pro Gly Cys Gin Gty Ser Ala Val Gly Ser Cys Leu Val Ser Glu Gly 
225 230 235 240 

Asn Ser Trp Asn Leu Gly Val Val Gin Met Ser Pro Gin Ala Ala Ala 

245 250 255 

Asn Gty Ser Leu Ser He Met Tyr Val Asn Gly Asp Lys Cys Gly Asn 

260 265 270 

Gin Arg Phe Ser Thr Arg He Thr Phe Glu Cys Ala Gin He Ser Gly 

275 280 285 

Ser Pro Ala Phe Gin Leu Gin Asp Gly Cys Glu Tyr Val Phe He Trp 

290 295 300 

Arg Thr Val Glu Ala Cys Pro Val Vat Arg Val Glu Gty Asp Asn Cys 
305 310 315 320 

Glu Val Lys Asp Pro Arg His Gly Ash Leu Tyr Asp Leu Lys Pro Leu 

325 330 335 

Gty Leu Asn Asp Thr He Val Ser Ala Gly Glu Tyr Thr Tyr Tyr Phe 

340 345 350 

Arg Val Cys Gly Lys Leu Ser Ser Asp Val Cys Pro Thr Ser Asp Lys 

355 360 365 

Ser Lys Vat Val Ser Ser Cys Gin Glu Lys Arg Glu Pro Gin Gly Phe 

370 375 380 

His Lys Val Ala Gly Leu Leu Thr Gin Lys Leu Thr Tyr Glu Asn Gly 
385 390 395 400 

Leu Leu Lys Met Asn Phe Thr Gly Gly Asp Thr Cys His Lys Val Tyr 

405 410 415 

Gin Arg Ser Thr Ala He Phe Phe Tyr Cys Asp Arg Gly Thr Gin Arg 

420 425 430 

Pro Val Phe Leu Lys Glu Thr Ser Asp Cys Ser Tyr Leu Phe Glu Trp 

435 440 445 

Arg Thr Gin Tyr Ala Cys Pro Pro Phe Asp Leu Thr Glu Cys Ser Phe 

450 455 460 

Lys Asp Gly Ala Gly Asn Ser Phe Asp Leu Ser Ser Leu Ser Arg Tyr 
465 470 475 480 

Ser Asp Asn Trp Glu Ala He Thr Gly Thr Gly Asp Pro Glu His Tyr 

485 490 495 

Leu lie Asn Val Cys Lys Ser Leu Ala Pro Gin Ala Gly Thr Glu Pro 

500 505 510 

Cys Pro Pro Glu Ala Ala Ala Cys Leu Leu Gly Gly Ser Lys Pro Val 

515 520 525 

Asn Leu Gly Arg Val Arg Asp Gly Pro Gin Trp Arg Asp Gty He lie 

530 535 540 

Val Leu Lys Tyr Val Asp Gly Asp Leu Cys Pro Asp Gly He Arg Lys 
545 550 555 560 

Lys Ser Thr Thr He Arg Phe Thr Cys Ser Glu Ser Gin Val Asn Ser 

565 570 575 

Arg Pro Met Phe He Ser Ala Val Glu Asp Cys Glu Tyr Thr Phe Ala 

580 585 590 

Trp Pro Thr Ala Thr Ala Cys Pro Met Lys Ser Asn Glu His Asp Asp 

595 600 605 

Cys Gin Val Thr Asn Pro Ser Thr Gly His Leu Phe Asp Leu Ser Ser 

610 615 620 

Leu Ser Gly Arg Ala Gly Phe Thr Ala Ala Tyr Ser Glu Lys Gly Leu 
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625 






630 








635 




640 


Vat 


Tyr Met Ser 


I le 


Cys 


Glv 


Glu 


Asn 


Glu Asn Cys Pro 


pro Gly 


Val 






645 










650 


655 




Glv 

7 


Ala Cys Phe 


Glv 


Gin 


Thr 


Ara 


I le 


Ser Val Glv Lvs 


Ala Asn 


Lvs 




660 










665 




670 




Arg 


Leu Arg Tyr 


Val 


ASD 


Gin 


Val 


Leu 


Gin Leu Val Tyr 


Lys Asd 


Gly 




675 








630 




6S5 






Ser 


Pro Cys Pro 


Ser 


Lvs 


Ser 


Glv 

u *• 7 


Leu 


Ser Tyr Lys Ser 


Val He 


Ser 




690 






695 






700 






Phe 


Val Cys Arg 


Pro 


Glu 


Ala 


Glv 


Pro 


Thr Asn Arg Pro 


Met Leu 


I le 


705 






710 








715 




720 


Ser 


Leu Asp Lys 


Gin 


Thr 


Cvs 


Thr 


Leu 


Phe Phe Ser Trp 


His Thr 


Pro 






725 










730 


735 




Leu 


Ala Cys G I u 


Gin 


Ala 


Thr 


Glu 


Cvs 

** 7 s * 


Ser Val Arg Asn 


Gly Ser 


Ser 




740 










7 AS 




750 




lie 


Va t Asp Leu 


Ser 


Pro 


Leu 


I le 


His 


Arg Thr Gly Gly 


Tyr Glu 


Ala 




755 








760 




765 






Tyr 


Asp Glu Ser 


Glu 


Asp 


Asp 


Ala 


Ser 


Asp Thr Asn Pro 


Asp Phe 


Tyr 




770 






775 






780 






He 


Asn I le Cys 


Gin 


Pro 


Leu 


Asn 


Pro 


Met His Gly Val 


Pro Cys 


Pro 


785 






790 








795 




800 


Ala 


Gly Ala Ala 


Val 


Cys 


Lys 


Val 


Pro 


I le Asp Gly Pro 


Pro lie 


Asp 






805 










810 


815 




He 


Gly Arg Val 


Ala 


Gly 


Pro 


Pro 


He 


Leu Asn Pro I le 


Ala Asn 


Glu 




820 










825 




830 




He 


Tyr Leu Asn 


Phe 


Glu 


Ser 


Ser 


Thr 


Pro Cys Gin Glu 


Phe Ser 


Cys 




835 








840 




845 






Lys 





















<2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 
CD) TOPOLOGY: linear 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 153: 






Met 


Ala 


Arg 


Leu 


Ser 


Arg 


Pro 


Glu 


Arg 


Pro 


Asp 


Leu Val 


Phe Glu 


Glu 


1 








5 










10 






15 




Glu 


Asp 


Leu 


Pro 


Tyr 


Glu 


Glu 


Glu 


He 


Met 


Arg 


Asn Gin 


Phe Ser 


Val 








20 










25 








30 




Lys 


Cys 


Trp 


Leu 


His 


Tyr 


He 


Glu 


Phe 


Lys 


Gin 


Gly Ala 


Pro Lys 


Pro 






35 










40 








45 






Arg 


Leu 


Asn 


Gin 


Leu 




Glu 


Arg 


Ala 


Leu 


Lys 


Leu Leu 


Pro Cys 


Ser 




50 










55 










60 






Tyr 


Lys 


Leu 


Trp 


Tyr 


Arg 


Tyr 


Leu 


Lys 


Ala 


Arg 


Arg Ala 


Gin Val 


Lys 


65 










70 










75 






80 


His 


Arg 


Cys 


Val 


Thr 


Asp 


Pro 


Ala 


Tyr 


Glu 


Asp 


Val Asn 


Asn Cys 


His 










B5 










90 






95 




Glu 


Arg 


Ala 


Phe 


Val 


Phe 


Met 


His 


Lys 


Met 


Pro 


Arg Leu 


Trp Leu 


Asp 








100 










105 








110 


Tyr 


Cys 


Gin 


Phe 


Leu 


Met 


Asp 


Gin 


Gly 


Arg 


Val 


Thr His 


Thr Arg 


Arg 






115 










120 








125 






Thr 


Phe 


Asp 


Arg 


Ala 


Leu 


Arg 


Ala 


Leu 


Pro 


He 


Thr Gin 


His Ser 


Arg 




130 










135 










140 






He 


Trp 


Pro 


Leu 


Tyr 


Leu 


Arg 


Phe 


Leu 


Arg 


Ser 


His Pro 


Leu Pro 


Glu 


145 










150 










155 






160 


Thr 


Ala 


Val 


Arg 


Gly 


Tyr 


Arg 


Arg 


Phe 


Leu 


Lys 


Leu Ser 


Pro Glu 


Ser 










165 










170 






175 




Ala 


Glu 


Glu 


Tyr 


He 


Glu 


Tyr 


Leu 


Lys 


Ser 


Ser 


Asp Arg 


Leu Asp 


Glu 








180 










185 








190 




Ala 


Ala 


Gin 


Arg 


Leu 


Ala 


Thr 


Val 


Val 


Asn 


Asp 


Glu Arg 


Phe Val 


Ser 






195 










200 






205 






Lys 


Ala 


dy 


Lys 


Ser 


Asn 


Tyr 


Gin 


Leu 


Trp 


His 


Glu Leu 


Cys Asp 


Leu 




210 










215 








220 




He 


Ser 


Gin 


Asn 


Pro 


Asp 


Lys 


Val 


Gin 


Ser 


Leu 


Asn Val 


Asp Ala 


He 


225 










230 










235 




240 


He 


Arg 


Gly 


Gly 


Leu 


Thr 


Arg 


Phe 


Thr 


Asp 


Gin 


Leu Gly 


Lys Leu 


Trp 










245 










250 






255 
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U-y5 SCI LcU 


A I ~ 
M I a 


Asp Tyr 


Tyr I le 


Arg 




cou 






265 


Hi g M5p Vdl 


1 yr 


UlU UlU 


Ala 1 1 a 
Ala lie 


Arg 


Cf J 










"ne inr bin 


va 1 


Phe Asp 


Ser Tyr 


A 1 a 

Aia 


290 










Aia A 1 3 Lys 


net 


uiu inr 






JUJ 




Tin 
J 1 U 






A^a % 1 «^ 1 A^n 

Asp vai Asp 


Leu 


Glu Leu 


Arg Leu 


A 1 1 

Aia 






JC.J 






Arg Arg Pro 


Leu 


Leu Leu 


Asn Ser 


Va 1 

vai 




TAn 






**a«; 


Uie \/a 1 U i c 

nis va 1 n 1 s 


r*iii 

u IU 


1 rp n 1 s 


Lys Arg 


va 1 












Arg Glu I le 


I 1 A 

1 le 


Asn Thr 


Tyr Thr 


r t * 1 


Jf u 






jO 




Phe Lys Ala 


1 nr 


Gly Lys 


Pro His 


inr 






Ton 






Phe Tyr Glu 


Asp 


Asn Gly 


1 

Gin Leu 


Asp 






z.n<; 

HUD 






Lys Ala Tnr 


Lys 


vai Asn 


Phe Lys 


r t n 

ijin 










425 


T rp Cys Gin 


Cys 


uiy uiu 


Leu Glu 


Leu 


a - **; 










Ala Leu Arg 


Leu 


Leu Arg 


Lys Ala 


inr 








A<^ 




Glu Tyr Phe 


Asp 


Gly Ser 


Glu Pro 


Val 






A7n 






Leu Lys Val 


Trp 


Ser Met 


Leu Ala 


Asp 






AftQ 

*K5-> 






pne uin ser 


inr 


Lys Ala 


Val Tyr 


Asp 




enn 
5UU 






505 


Ala Tnr Pro 


bin 


lie vai 


T 1 a Aon 

lie Asn 


Tyr 








5eU 


Lys Tyr Phe 


Glu 


Glu Ser 


Phe Lys 


A 1 ~ 

Ala 








555 




Phe Lys Trp 


Pro 


Asn Val 


Ser Asp 


He 












Dhtt I 1 a Ala 

rne lie Ala 


Arg 


iyr uiy 


Gly Arg 


Lys 






SOD 






DhA f* 1 1 ■ dm 

rne uiu uin 


A 1 a 

Aia 


Leu Asp 


Gly Cys 


Pro 










«;««; 


Twit 1 At 1 I Al | 

1 y r Leu Leu 


Tyr 


Ala C 1 r» 

Aia uin 


Leu Glu 


UIU 








OUU 




Aia net Aia 


va 1 


Tyr Glu 


Arg Ala 


Thr 


OTU 






615 




cin Tyr Asp 


Met 


Phe Asn 


lie Tyr 


1 le 


A3*; 










Gly Val Tnr 


His 


Thr Arg 


Gly lie 


Tyr 






645 






Ser Asp Glu 


Hts 


Ala Arg 


Glu Met 


Cys 




00U 






665 


Cys Lys Leu 


Gly 


Glu lie 


Asp Arg 


Ala 


675 






680 




ser Gin lie 


Cys 


Asp Pro 


Arg Thr 


Thr 


/on 
ovu 






695 




Lys Asp Phe 


Glu 


Va l Arg 


His Gly 


Asn 


7ni 










Leu Arg lie 


Arg 


Arg Ser 


Val Gin 


Ala 












Phe net Ala 


Ser 


Gin Met 


Leu Lys 


Val 












Val Ser Asp 


Leu 


Ala Pro 


Gly Gin 


Ser 


755 






760 




Leu Glu Gin 


Arg 


Ala Glu 


Gin Leu 


Ala 


770 






775 




Pro Leu Arg 


Ala 


Gin Ser 


Lys lie 


Leu 


785 




790 




Arg Glu Glu 


Leu 


Ala Glu 


Leu Ala 


Gin 






805 






Gin Leu Gly 


Glu 


Asp Glu 


Asp Glu 


Asp 




820 






825 


Glu Val Arg 


Leu 


Glu Gin 


Gin Ser 


Val 



Ser Gly His 
Thr Val Met 

Gin Phe Glu 

300 

Leu Gly Arg 

315 
Arg Phe Glu 
330 

Leu Leu Arg 

Ala Leu His 

Ala Val Gin 
380 

Leu Trp Val 

395 
Asp Ala Arg 
410 

Val Asp Asp 

Arg His Glu 

Ala Leu Pro 
460 

Gin Asn Arg 

475 
Leu Glu Glu 
490 

Arg I le Leu 

Ala Met Phe 

Tyr Glu Arg 
540 

Trp Ser Thr 

555 
Leu Glu Arg 
570 

Pro Lys Tyr 

Glu Trp Gly 

Arg Ala Val 
620 

Lys Arg Ala 

635 
Gin Lys Ala 
650 

Leu Arg Phe 

Arg Ala He 

Gly Ala Phe 
700 

Glu Asp Thr 

715 
Thr Tyr Asn 
730 

Ser Gly Ser 

Gly Met Asp 

Ala Glu Ala 
780 

Phe Val Arg 

795 
Gin Val Asn 
810 

Glu Met Asp 
Pro Ala Ala 



Phe Glu 
270 
Thr Val 
285 

Glu Ser 

Glu Glu 

Gin Leu 

Gin Asn 
350 
Gin Gly 
365 

Thr Val 

Ala Phe 

Val lie 

Leu Ala 
430 
Asn Tyr 
445 

Ala Arg 

Val Tyr 

Ser Leu 

Asp Leu 
510 
Leu Glu 
525 

Gly He 

Tyr Leu 

Ala Arg 

Ala Lys 
590 
Leu Ala 
605 

Glu Pro 

Ala Glu 

He Glu 

Ala Asp 
670 
Tyr Ser 
685 

Trp Gin 

He Lys 

Thr Gin 

Ala Thr 
750 
Asp Met 
765 

Glu Arg 

Ser Asp 

Pro Glu 

Leu Glu 
830 
Val Phe 



Lys Ala 

Arg Asp 

Met lie 

Glu Asp 
320 
I le Ser 
335 

Pro His 

Arg Pro 

Asp Pro 

Ala Lys 
400 
Leu Glu 
415 

Ser Val 

Asp Glu 

Arg Ala 

Lys Ser 
480 
Gly Thr 
495 

Arg He 

Glu His 

Ser Leu 

Thr Lys 
560 
Asp Leu 
575 

Thr Leu 

Arg His 

Ala Gin 

lie Tyr 
640 
Val Leu 
655 

Met Glu 

Phe Cys 

Thr Trp 

Glu Met 
720 
Val Asn 
735 

Gly Thr 

Lys Leu 

Asp Gin 

Ala Ser 
800 
Glu He 
815 

Pro Asn 
Gly Ser 
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835 840 845 

Leu Lys Glu Asp 
850 

(2) INFORMATION FOR SEQ ID NO:154: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 693 amino acids 
(B> TYPE: amino acid 
(C> STRANDEONESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID M0:154: 



Met 


Phe 


Ser 


Ala 


Leu Lys 




filv £f*r A^n 


Gin 


Ala 


Pro 


Glv 










5 




10 






15 




Arg 


Aso 


Lys 


Asn 


I le Pro 


Ala Hi v 1 ai i 


Gin Ser Met 


Asn 


Gin 


Ala 


Leu 








20 




25 






30 






Gin 


Arts 


Arg 


Phe 


Ala Lys 


fi 1 v Va I Gin 

ULJr Val Ulll 


Tvr Acn Met 
i y i nail rici 


Lvc 


I le 


Val 


I le 






35 






40 




45 








A rn 


Gl V 


Msp 


Ara 

mi a 


Asn Thr 


f» I v 1 vc Th r 

wljr 7 III! 


A 1 a 1 Trn 


Hi s 


A ro 
mi y 


Leu 


Gin 




50 








55 


60 










fit V 

ui y 


Arg 


Pro 


Phe 


Val Glu 


Glu Tvr I 1 e 


Pro Thr* HI r\ 

r 1 \J .1111 Ulll 


Glu 


I le 


Gin 


Val 


65 








70 




75 








80 


Thr 


Ser 


I le 


His 




Tvr 1 vc Thr 
i y i L. y ^ iiii 


Thr Acn Acn 

1 III ndJJ n*i^ 


lie 


Val 


1 vc 
uys 


Val 














90 










Glu 


Val 


Trn 

i rp 


Asp 


Va I Va I 


nsp Lyb uiy 


L.ys v<yb i-"^ 


Lys 


Arg 


fl 1 V 

uiy 


sp 








100 




105 






1 10 






uiy 


Leu 


1 vc 

Lys 


Mat- 
net 


ulu Man 


A en Di*a fl 1 r> 
nSp rru lain 


UlU A act u I U 


Cap 


ulu 


Met 

net 


At a 
Mia 






IIS 






1 C\l 




IP'S 
i til 








Leu 


HSp 


Aia 


til ** 


Phe Leu 


Asp Val Tyr 


Lys Asn Cys 


Asn 


fl 1 vy 

uiy 


W a | 

va i 


va I 




IJU 




















Mat 


net 


rne 


Asp 


lie inr 


Lys uin i rp 


i nr rne asm 


T V /r 

iyr 


i ie 


Leu 


Arg 


145 








isn 




ICC 








1 An 


ulU 


Leu 


Pro 


Lys 


Vol rru 


Thr Uic Ual 

tnr n i s va i 


rro .vai Lys 


va i 


Leu 


uiy 


Asn 










103 




t IV 






17*; 




T yr 


Arg 


Asp 


ncl 


ffly ft! It 

uiy uiu 


Hie Ara Val 
n i s Mrg wai 


lie LcU ri Q 


ASp 


Asp 


Vol 


Arg 








IOU 




1 OJ 






ion 






A 


Phe 


He 


Asp 


Asn Leu 


Acn Ara Dm 


Prn ft 1 v Cpp 

r 1 V Uljr SCI 


Ser 


Tyr 


Phe 


Arg 












200 












i yi 


Ala 


Glu 


Ser 


Ser Met 


1 vc Acn Q fi p 
kjra nail wCi 


Ohp fi 1 \f 1 ail 
rllv Uljr LCU 


Lys 


_ 

i yr 


eu 


His 




210 








215 


220 










i 

Lys 


Phe 


Phe 


sn 


1 1 e pro 


rlic Leu Ulll 


1 oil rl n Apn 

lcu uiii mi y 


Glu 


Thr 


Leu 


Leu 
























Arg 


uin 


Leu 


Ulu 


Thr Asn 


Gin Leu Asp 


Mat- Am Ala 

net Asp Ala 


Th r* 

i nr 


Leu 


rin 
u iu 


bill 
























Leu 


Ser 


Val 


Gin 


Gin Glu 


Thr Glu Asp 


Gin. Asn Tyr 


Gly 


lie 


Phe 


Leu 








260 




265 






270 






Glu 


Met 


Met 


Glu 


Ala Arg 


Ser Arg Gly 


His Ala Ser 


Pro 


Leu 


Ala 


Ala 






275 






280 




285 








Asn 


Gly 


Gin 


Ser 


Pro Ser 


Pro Gly Ser 


Gin Ser Pro 


Val 


Leu 


Pro 


Ala 




290 








295 


300 










Pro 


Ala 


Val 


Ser 


Thr Gly 


Ser Ser Ser 


Pro Gly Thr 


Pro 


Gin 


Pro 


Ala 


305 








310 




315 








320 


Pro 


Gin 


Leu 


Pro 


Leu Asn 


Ala Ala Pro 


Pro Ser Ser 


Val 


Pro 


Pro 


Val 










325 




330 






335 




Pro 


Pro 


Ser 


Glu 


Ala Leu 


Pro Pro Pro 


Ala Cys Pro 


Ser 


Ala 


Pro 


Ala 








340 




345 






350 






Pro 


Arg 


Arg 


Ser 


He lie 


Ser Arg Leu 


Phe Gly Thr 


Ser 


Pro 


Ala 


Thr 






355 






360 




365 








Glu 


Ala 


Ala 


Pro 


Pro Pro 


Pro Glu Pro 


Val Pro Ala 


Ala 


Gin 


Gly 


Pro 




370 








375 


380 










Ala 


Thr 


Val 


Gin 


Ser Val 


Glu Asp Phe 


Val Pro Asp 


Asp 


Arg 


Leu 


Asp 


385 








390 




395 








400 


Arg 


Ser 


Phe 


Leu 


Glu Asp 


Thr Thr Pro 


Ala Arg Asp 


Glu 


Lys 


Lys 


Val 










405 




410 






415 




Gly 


Ala 


Lys 


Ala 


Ala Gin 


Gin Asp Ser 


Asp Ser Asp 


Gly 


Glu 


Ala 


Leu 








420 




425 






430 






Gly 


Gly 


Asn 


Pro 


Met Val 


Ala Gly Phe 


Gin Asp Asp 


Val 


Asp 


Leu 


Glu 






435 






440 




445 








Asp 


Gin 


Pro 


Arg 


Gly Ser 


Pro Pro Leu 


Pro Ala Gly 


Pro 


Val 


Pro 


Ser 




450 








455 


460 











WO 99/58559 



46 



PCT/US99/10793 



Gin 


Asp 1 te Thr 


Leu Ser 


Ser Glu 


465 




470 




Thr 


Lys Gly Pro 


Ala Pro 


Ala Pro 






485 




Lys 


Trp Ser Ser 


I le Pro 


Ala Ser 




500 






Thr 


Arg Thr Ala 


Ala Fro 


Pro 1 rp 




515 




520 


GLv 


Pro Glu Lys 


Arg Ser 


Ser Thr 




530 




535 


Glv 


Lys Gly Glu 


Gin Ala 


Ser Ser 


545 




550 




I le 


Ala Ala Gin 


Met Leu 


Ser Phe 






565 




Ser 


Glu Gly Ser 


Asp Thr 


Gin Arg 




580 






Asd 


Asp Pro Ser 


Asp Val 


Thr Asp 




595 




600 


Pro 


Pro Pro Pro 


Lys Leu 


Pro Leu 




610 




615 


Ser 


Asp Leu Phe 


Gly Leu 


Gly Leu 


625 




630 




Ser 


Glu Glu Gly 


Lys Glu 


Gly Lys 






645 




Lys 


Thr Lys Ser 


Phe Ser 


Arg Val 




660 






Arg 


Phe Ser Thr 


Arg Val 


Gly Tyr 




675 




680 


Tyr 


Ser Glu Ser 


Tyr 






690 







Glu 


Glu 


Ala 


Glu 


Val 


Ala 


Ala 


Pro 






475 










480 


Gin 


Gin 


Cys 


Ser 


Glu 


Pro 


Glu 


Thr 




490 










495 




Lys 


Pro 


Arg 


Arg 


Gly 


Thr 


Ala 


Pro 


505 










510 






Pro 


Gly 


Gly 


Val 


Ser 


Val 


Arg 


Thr 








525 








Arg 


Pro 


Pro 


Ala 


Glu 


Met 


Glu 


Pro 








540 










Ser 


Glu 


Ser 


Asp 


Pro 


Glu 


Gly 


Pro 






555 










560 


Val 


Met 


Asp 


Asp 


Pro 


Asp 


Phe 


Glu 




570 










575 




Arg 


Ala 


Asp 


Asp 


Phe 


Pro 


Val 


Arg 


585 










590 






Glu 


Asp 


Glu 


Gly 


Pro 


Ala 


Glu 


Pro 










605 








Pro 


Ala 


Phe 


Arg 


Leu 


Lys 


Asn 


Asp 








620 










Glu 


Glu 


Ala 


Gly 


Pro 


Lys 


Glu 


Ser 






635 










640 


Thr 


Pro 


Ser 


Lys 


Glu 


Lys 


Lys 


Lys 




650 










655 




Leu 


Leu 


Glu 


Arg 


Pro 


Arg 


Ala 


His 


665 










670 






Gin 


Val 


Ser 


Val 


Pro 


Asn 


Ser 


Pro 



685 



